DataFrame
NO COMPARISON TARGET
100000
ROWS
0
DUPLICATES
105.7 MB
RAM
88
FEATURES
40
CATEGORICAL
44
NUMERICAL
4
TEXT
2.3.1
Get updates, docs & report issues here

Created & maintained by Francois Bertrand
Graphic design by Jean-Francois Hains
target_encoded
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
4
(<1%)
ZEROES:
63,579
(64%)
MAX
3.00
95%
1.00
Q3
1.00
AVG
0.43
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
3.00
IQR
1.00
STD
0.649
VAR
0.422
KURT.
3.31
SKEW
1.68
SUM
43,141
1
begins_with
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
2
css_pk
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
21,608
(22%)
156
<1%
-
-
3421889
152
<1%
-
-
5198111
116
<1%
-
-
4151547
114
<1%
-
-
3684862
114
<1%
-
-
3134707
112
<1%
-
-
5338878
112
<1%
-
-
4310017
99,124
>99%
-
-
(Other)
3
customer_pk
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,129
(1%)
ZEROES:
---
MAX
5,090
95%
4,089
Q3
3,020
AVG
2,088
MEDIAN
1,861
Q1
1,118
5%
287
MIN
24
RANGE
5,066
IQR
1,902
STD
1,168
VAR
1.4M
KURT.
-0.816
SKEW
0.304
SUM
208.8M
4
is_italic
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
5
is_bold
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
6
html_pk
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
21,608
(22%)
156
<1%
-
-
3421890
152
<1%
-
-
5198112
116
<1%
-
-
4151660
114
<1%
-
-
3684965
114
<1%
-
-
3134708
112
<1%
-
-
5338884
112
<1%
-
-
4310018
99,124
>99%
-
-
(Other)
7
id
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
100,000
(100%)
1
<1%
-
-
384942|5169609|69D269D9
1
<1%
-
-
245324|3561143|3A7780D4
1
<1%
-
-
400757|5351822|038D3302
1
<1%
-
-
376043|5066332|370C7904
1
<1%
-
-
427401|5676925|74B85401
1
<1%
-
-
379048|5102997|52FC1195
1
<1%
-
-
388548|5208639|5900132E
99,993
>99%
-
-
(Other)
8
is_underline
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
9
target
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
4
(<1%)
10
form_rel_depth
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
240
(<1%)
ZEROES:
---
MAX
615
95%
5
Q3
4
AVG
4
MEDIAN
2
Q1
1
5%
1
MIN
1
RANGE
614
IQR
3.00
STD
11.5
VAR
132
KURT.
791
SKEW
23.7
SUM
359k
11
form_rel_font_size
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
47
(<1%)
ZEROES:
---
MAX
74.0
95%
8.0
Q3
4.0
AVG
3.1
MEDIAN
2.0
Q1
1.0
5%
1.0
MIN
1.0
RANGE
73.0
IQR
3.00
STD
2.73
VAR
7.45
KURT.
28.4
SKEW
3.13
SUM
308k
12
form_font_family_mode_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
13
form_font_colour_mode_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
14
lang_num_sents
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
166
(<1%)
ZEROES:
---
MAX
400
95%
3
Q3
1
AVG
2
MEDIAN
1
Q1
1
5%
1
MIN
1
RANGE
399
IQR
0.00
STD
7.51
VAR
56.3
KURT.
1,213
SKEW
31.5
SUM
159k
15
lang_num_words
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
669
(<1%)
ZEROES:
---
MAX
14,924
95%
57
Q3
19
AVG
25
MEDIAN
8
Q1
3
5%
1
MIN
1
RANGE
14,923
IQR
16.0
STD
216
VAR
46,711
KURT.
1,491
SKEW
34.0
SUM
2.5M
16
lang_mean_words_per_sent
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,114
(1%)
ZEROES:
---
MAX
650
95%
31
Q3
16
AVG
11
MEDIAN
7
Q1
3
5%
1
MIN
1
RANGE
649
IQR
12.5
STD
11.8
VAR
139
KURT.
118
SKEW
4.93
SUM
1.1M
17
lang_ls_alnum
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
18
lang_ls_qm
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
19
lang_ls_fs
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
20
lang_ls_clscl
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
21
lang_ls_brkt
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
22
para_prec_depth_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
3
(<1%)
23
para_foll_depth_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
3
(<1%)
24
para_prec_size_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
3
(<1%)
25
para_foll_size_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
3
(<1%)
26
para_prec_bold_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
27
para_foll_bold_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
28
para_prec_italic_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
29
para_foll_italic_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
30
para_prec_underline_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
31
para_foll_underline_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
32
para_prec_font_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
33
para_foll_font_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
34
para_prec_colour_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
35
para_foll_colour_ind
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
36
is_upper
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
37
is_title
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
38
style
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2,387
(2%)
50,364
50%
-
-
default
14,284
14%
-
-
listparagraph
5,120
5%
-
-
tableparagraph
3,395
3%
-
-
bodytext
2,971
3%
-
-
normal
2,000
2%
-
-
heading2
1,215
1%
-
-
heading1
20,651
21%
-
-
(Other)
39
style_bullet
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
40
style_table
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
41
style_list_num
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
42
style_heading
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
43
style_box
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
44
style_toc
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
45
style_q
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
46
style_ans
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
47
style_title
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
48
style_indent
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
49
style_cover_nm_add
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
50
style_head_foot
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2
(<1%)
51
lang_pct_coordinating_conjunction
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,231
(1%)
ZEROES:
67,567
(68%)
MAX
1.00
95%
0.12
Q3
0.04
AVG
0.03
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.041
STD
0.053
VAR
0.003
KURT.
22.5
SKEW
3.42
SUM
2,657
52
lang_pct_cardinal_digit
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,026
(1%)
ZEROES:
83,982
(84%)
MAX
1.00
95%
0.20
Q3
0.00
AVG
0.03
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.095
VAR
0.009
KURT.
33.9
SKEW
5.16
SUM
2,736
53
lang_pct_determiner
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,553
(2%)
ZEROES:
58,122
(58%)
MAX
1.00
95%
0.19
Q3
0.09
AVG
0.05
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.091
STD
0.109
VAR
0.012
KURT.
40.6
SKEW
5.37
SUM
5,498
54
lang_pct_existential_there
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
377
(<1%)
ZEROES:
98,432
(98%)
MAX
0.200
95%
0.000
Q3
0.000
AVG
0.001
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.200
IQR
0.00
STD
0.007
VAR
4.88e-5
KURT.
214
SKEW
13.3
SUM
68.1
55
lang_pct_foreign_word
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
578
(<1%)
ZEROES:
97,627
(98%)
MAX
0.750
95%
0.000
Q3
0.000
AVG
0.002
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.750
IQR
0.00
STD
0.018
VAR
3.13e-4
KURT.
483
SKEW
19.4
SUM
162
56
lang_pct_preposition_subordinating_conjunction
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,498
(1%)
ZEROES:
52,650
(53%)
MAX
1.00
95%
0.20
Q3
0.11
AVG
0.06
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.107
STD
0.079
VAR
0.006
KURT.
10.5
SKEW
2.05
SUM
5,885
57
lang_pct_adjective
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,427
(1%)
ZEROES:
55,272
(55%)
MAX
1.00
95%
0.25
Q3
0.08
AVG
0.06
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.083
STD
0.123
VAR
0.015
KURT.
24.3
SKEW
4.20
SUM
6,309
58
lang_pct_adjective_comparative
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
516
(<1%)
ZEROES:
98,228
(98%)
MAX
0.500
95%
0.000
Q3
0.000
AVG
0.001
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.010
VAR
1.07e-4
KURT.
1,130
SKEW
28.9
SUM
74.9
59
lang_pct_adjective_superlative
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
511
(<1%)
ZEROES:
98,222
(98%)
MAX
1.00
95%
0.00
Q3
0.00
AVG
0.00
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.011
VAR
1.30e-4
KURT.
3,386
SKEW
47.1
SUM
74.3
60
lang_pct_list_marker
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
21
(<1%)
ZEROES:
99,974
(>99%)
MAX
0.333
95%
0.000
Q3
0.000
AVG
0.000
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.333
IQR
0.00
STD
0.002
VAR
4.81e-6
KURT.
13,548
SKEW
108
SUM
2.66
61
lang_pct_modal
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
967
(<1%)
ZEROES:
85,543
(86%)
MAX
1.00
95%
0.05
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.025
VAR
6.21e-4
KURT.
407
SKEW
12.9
SUM
724
62
lang_pct_noun_singular
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,909
(2%)
ZEROES:
24,943
(25%)
MAX
1.00
95%
1.00
Q3
0.33
AVG
0.25
MEDIAN
0.17
Q1
0.03
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.307
STD
0.282
VAR
0.079
KURT.
1.80
SKEW
1.59
SUM
25,047
63
lang_pct_noun_plural
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,531
(2%)
ZEROES:
52,103
(52%)
MAX
1.00
95%
0.33
Q3
0.10
AVG
0.08
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.100
STD
0.146
VAR
0.021
KURT.
18.2
SKEW
3.77
SUM
7,762
64
lang_pct_proper_noun_singular
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
2,018
(2%)
ZEROES:
41,726
(42%)
MAX
1.00
95%
0.80
Q3
0.25
AVG
0.18
MEDIAN
0.06
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.250
STD
0.262
VAR
0.069
KURT.
2.55
SKEW
1.82
SUM
17,519
65
lang_pct_proper_noun_plural
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
564
(<1%)
ZEROES:
97,673
(98%)
MAX
0.500
95%
0.000
Q3
0.000
AVG
0.002
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.017
VAR
2.96e-4
KURT.
338
SKEW
16.6
SUM
159
66
lang_pct_predeterminer
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
210
(<1%)
ZEROES:
99,622
(>99%)
MAX
0.250
95%
0.000
Q3
0.000
AVG
0.000
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.250
IQR
0.00
STD
0.003
VAR
8.29e-6
KURT.
2,557
SKEW
44.0
SUM
10.9
67
lang_pct_possessive_ending
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
416
(<1%)
ZEROES:
98,576
(99%)
MAX
0.667
95%
0.000
Q3
0.000
AVG
0.001
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.667
IQR
0.00
STD
0.009
VAR
8.02e-5
KURT.
1,251
SKEW
29.2
SUM
63.9
68
lang_pct_personal_pronoun
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
872
(<1%)
ZEROES:
88,414
(88%)
MAX
1.00
95%
0.05
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.028
VAR
7.64e-4
KURT.
385
SKEW
13.6
SUM
672
69
lang_pct_possessive_pronoun
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
901
(<1%)
ZEROES:
86,301
(86%)
MAX
1.00
95%
0.06
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.027
VAR
7.27e-4
KURT.
82.4
SKEW
6.50
SUM
798
70
lang_pct_adverb
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,061
(1%)
ZEROES:
83,297
(83%)
MAX
1.00
95%
0.07
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.047
VAR
0.002
KURT.
135
SKEW
9.54
SUM
1,202
71
lang_pct_adverb_comparative
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
265
(<1%)
ZEROES:
99,505
(>99%)
MAX
0.500
95%
0.000
Q3
0.000
AVG
0.000
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.004
VAR
1.29e-5
KURT.
8,467
SKEW
74.5
SUM
13.0
72
lang_pct_adverb_superlative
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
271
(<1%)
ZEROES:
99,412
(>99%)
MAX
0.333
95%
0.000
Q3
0.000
AVG
0.000
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.333
IQR
0.00
STD
0.004
VAR
1.43e-5
KURT.
2,127
SKEW
37.3
SUM
18.4
73
lang_pct_particle
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
398
(<1%)
ZEROES:
98,638
(99%)
MAX
0.500
95%
0.000
Q3
0.000
AVG
0.001
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.008
VAR
5.94e-5
KURT.
1,651
SKEW
33.2
SUM
53.3
74
lang_pct_to_infinitive_preposition
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,091
(1%)
ZEROES:
77,370
(77%)
MAX
1.00
95%
0.08
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.033
VAR
0.001
KURT.
26.7
SKEW
3.81
SUM
1,401
75
lang_pct_interjection
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
295
(<1%)
ZEROES:
97,319
(97%)
MAX
1.00
95%
0.00
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.090
VAR
0.008
KURT.
104
SKEW
10.1
SUM
1,015
76
lang_pct_verb_base_form
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,254
(1%)
ZEROES:
68,228
(68%)
MAX
1.00
95%
0.12
Q3
0.04
AVG
0.03
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.042
STD
0.055
VAR
0.003
KURT.
64.0
SKEW
5.26
SUM
2,672
77
lang_pct_verb_past_tense
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
841
(<1%)
ZEROES:
90,964
(91%)
MAX
1.00
95%
0.04
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.033
VAR
0.001
KURT.
185
SKEW
11.1
SUM
614
78
lang_pct_verb_gerund_present_participle
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,033
(1%)
ZEROES:
83,477
(83%)
MAX
1.00
95%
0.07
Q3
0.00
AVG
0.01
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.063
VAR
0.004
KURT.
143
SKEW
10.6
SUM
1,359
79
lang_pct_verb_past_participle
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,150
(1%)
ZEROES:
75,799
(76%)
MAX
1.00
95%
0.10
Q3
0.00
AVG
0.02
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.055
VAR
0.003
KURT.
114
SKEW
8.30
SUM
1,870
80
lang_pct_verb_sing_present_non_third_person
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
980
(<1%)
ZEROES:
82,718
(83%)
MAX
0.500
95%
0.071
Q3
0.000
AVG
0.010
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.032
VAR
0.001
KURT.
58.6
SKEW
5.87
SUM
1,047
81
lang_pct_verb_3rd_person_sing_present
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,019
(1%)
ZEROES:
79,279
(79%)
MAX
0.500
95%
0.080
Q3
0.000
AVG
0.013
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.500
IQR
0.00
STD
0.034
VAR
0.001
KURT.
26.0
SKEW
4.16
SUM
1,290
82
lang_pct_wh_determiner
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
717
(<1%)
ZEROES:
93,978
(94%)
MAX
0.333
95%
0.013
Q3
0.000
AVG
0.002
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.333
IQR
0.00
STD
0.010
VAR
1.05e-4
KURT.
76.2
SKEW
7.23
SUM
206
83
lang_pct_wh_pronoun
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
593
(<1%)
ZEROES:
94,390
(94%)
MAX
0.333
95%
0.013
Q3
0.000
AVG
0.003
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.333
IQR
0.00
STD
0.016
VAR
2.63e-4
KURT.
50.4
SKEW
6.46
SUM
321
84
lang_pct_possessive_wh_pronoun
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
85
(<1%)
ZEROES:
99,898
(>99%)
MAX
0.071
95%
0.000
Q3
0.000
AVG
0.000
MEDIAN
0.000
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.071
IQR
0.00
STD
7.62e-4
VAR
5.81e-7
KURT.
3,655
SKEW
55.6
SUM
1.77
85
lang_pct_wh_abverb
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
667
(<1%)
ZEROES:
92,796
(93%)
MAX
1.00
95%
0.03
Q3
0.00
AVG
0.00
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.019
VAR
3.42e-4
KURT.
147
SKEW
7.99
SUM
398
86
lang_pct_punct
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
1,685
(2%)
ZEROES:
39,655
(40%)
MAX
0.970
95%
0.364
Q3
0.150
AVG
0.102
MEDIAN
0.071
Q1
0.000
5%
0.000
MIN
0.000
RANGE
0.970
IQR
0.150
STD
0.125
VAR
0.016
KURT.
2.47
SKEW
1.57
SUM
10,247
87
lang_pct_sym
VALUES:
100,000
(100%)
MISSING:
---
DISTINCT:
32
(<1%)
ZEROES:
99,917
(>99%)
MAX
1.00
95%
0.00
Q3
0.00
AVG
0.00
MEDIAN
0.00
Q1
0.00
5%
0.00
MIN
0.00
RANGE
1.00
IQR
0.00
STD
0.022
VAR
4.66e-4
KURT.
2,111
SKEW
45.8
SUM
49.1
target_encoded
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_wh_pronoun
0.12
lang_pct_possessive_pronoun
0.12
lang_pct_cardinal_digit
-0.11
lang_pct_wh_abverb
0.11
form_rel_font_size
-0.11
lang_pct_verb_base_form
0.10
lang_pct_coordinating_conjunction
0.09
lang_pct_verb_3rd_person_sing_present
0.08
lang_pct_verb_sing_present_non_third_person
0.08
lang_pct_personal_pronoun
0.07
lang_pct_interjection
-0.07
lang_pct_noun_plural
0.06
lang_mean_words_per_sent
0.06
lang_pct_preposition_subordinating_conjunction
0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

target
1.00
lang_ls_qm
0.26
begins_with
0.24
style_heading
0.17
lang_ls_alnum
0.14
is_title
0.13
form_font_family_mode_ind
0.10
style_q
0.09
style_list_num
0.09
is_upper
0.09
style_toc
0.08
style_table
0.07
para_prec_size_ind
0.06
para_foll_size_ind
0.06
MOST FREQUENT VALUES

0
63,579
63.6%
1
31,798
31.8%
2
2,526
2.5%
3
2,097
2.1%
SMALLEST VALUES

0
63,579
63.6%
1
31,798
31.8%
2
2,526
2.5%
3
2,097
2.1%
LARGEST VALUES

3
2,097
2.1%
2
2,526
2.5%
1
31,798
31.8%
0
63,579
63.6%
Associations
[Only including dataset "DataFrame"]
Squares are categorical associations (uncertainty coefficient & correlation ratio) from 0 to 1. The uncertainty coefficient is assymmetrical, (i.e. ROW LABEL values indicate how much they PROVIDE INFORMATION to each LABEL at the TOP).

Circles are the symmetrical numerical correlations (Pearson's) from -1 to 1. The trivial diagonal is intentionally left blank for clarity.
Associations
[Only including dataset "None"]
Squares are categorical associations (uncertainty coefficient & correlation ratio) from 0 to 1. The uncertainty coefficient is assymmetrical, (i.e. ROW LABEL values indicate how much they PROVIDE INFORMATION to each LABEL at the TOP).

Circles are the symmetrical numerical correlations (Pearson's) from -1 to 1. The trivial diagonal is intentionally left blank for clarity.
target_encoded
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_wh_pronoun
0.12
lang_pct_possessive_pronoun
0.12
lang_pct_cardinal_digit
-0.11
lang_pct_wh_abverb
0.11
form_rel_font_size
-0.11
lang_pct_verb_base_form
0.10
lang_pct_coordinating_conjunction
0.09
lang_pct_verb_3rd_person_sing_present
0.08
lang_pct_verb_sing_present_non_third_person
0.08
lang_pct_personal_pronoun
0.07
lang_pct_interjection
-0.07
lang_pct_noun_plural
0.06
lang_mean_words_per_sent
0.06
lang_pct_preposition_subordinating_conjunction
0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

target
1.00
lang_ls_qm
0.26
begins_with
0.24
style_heading
0.17
lang_ls_alnum
0.14
is_title
0.13
form_font_family_mode_ind
0.10
style_q
0.09
style_list_num
0.09
is_upper
0.09
style_toc
0.08
style_table
0.07
para_prec_size_ind
0.06
para_foll_size_ind
0.06
MOST FREQUENT VALUES

0
63,579
63.6%
1
31,798
31.8%
2
2,526
2.5%
3
2,097
2.1%
SMALLEST VALUES

0
63,579
63.6%
1
31,798
31.8%
2
2,526
2.5%
3
2,097
2.1%
LARGEST VALUES

3
2,097
2.1%
2
2,526
2.5%
1
31,798
31.8%
0
63,579
63.6%
begins_with
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
text
69,404
69%
0.328
number
30,596
31%
0.667
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
begins_with
PROVIDES INFORMATION ON...

style_list_num
0.09
target
0.05
style_cover_nm_add
0.04
lang_ls_qm
0.03
style_toc
0.03
style_heading
0.03
para_prec_depth_ind
0.02
para_foll_depth_ind
0.02
style_table
0.02
style_bullet
0.01
style_q
0.01
lang_ls_alnum
0.01
lang_ls_brkt
0.01
is_italic
0.01

THESE FEATURES
GIVE INFORMATION
ON begins_with:

style_list_num
0.06
target
0.06
para_foll_depth_ind
0.02
para_prec_depth_ind
0.02
lang_ls_qm
0.02
lang_ls_alnum
0.01
style_table
0.01
style_heading
0.01
is_title
0.00
style_toc
0.00
lang_ls_brkt
0.00
lang_ls_fs
0.00
form_font_colour_mode_ind
0.00
form_font_family_mode_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
begins_with
CORRELATION RATIO WITH...

target_encoded
0.24
lang_pct_proper_noun_singular
0.10
lang_pct_wh_pronoun
0.10
lang_pct_possessive_pronoun
0.09
lang_pct_wh_abverb
0.07
lang_mean_words_per_sent
0.07
lang_pct_verb_3rd_person_sing_present
0.06
lang_pct_interjection
0.06
lang_pct_coordinating_conjunction
0.05
lang_pct_noun_singular
0.05
lang_pct_cardinal_digit
0.05
lang_pct_verb_base_form
0.04
lang_pct_preposition_subordinating_conjunction
0.04
lang_pct_personal_pronoun
0.04
css_pk
MISSING:
---
156
<1%
-
-
3421889
152
<1%
-
-
5198111
116
<1%
-
-
4151547
114
<1%
-
-
3684862
114
<1%
-
-
3134707
112
<1%
-
-
5338878
112
<1%
-
-
4310017
107
<1%
-
-
5669164
89
<1%
-
-
5446727
88
<1%
-
-
4606586
87
<1%
-
-
5392666
86
<1%
-
-
3811513
81
<1%
-
-
5067506
80
<1%
-
-
4391859
78
<1%
-
-
4395641
75
<1%
-
-
4526371
75
<1%
-
-
4339332
70
<1%
-
-
3857718
70
<1%
-
-
5518761
69
<1%
-
-
4283719
67
<1%
-
-
3216506
66
<1%
-
-
4224937
66
<1%
-
-
5006940
65
<1%
-
-
5318271
64
<1%
-
-
5699475
64
<1%
-
-
4329551
63
<1%
-
-
4330027
63
<1%
-
-
4570000
63
<1%
-
-
4520816
62
<1%
-
-
4128393
62
<1%
-
-
4467887
62
<1%
-
-
5208638
61
<1%
-
-
4646851
60
<1%
-
-
5712686
59
<1%
-
-
4196951
58
<1%
-
-
4146161
57
<1%
-
-
3108162
56
<1%
-
-
4161394
56
<1%
-
-
3030988
56
<1%
-
-
3570069
54
<1%
-
-
3213055
53
<1%
-
-
5532972
52
<1%
-
-
4605599
52
<1%
-
-
4792895
51
<1%
-
-
5721690
51
<1%
-
-
4385200
50
<1%
-
-
3832057
50
<1%
-
-
4760043
50
<1%
-
-
3561069
49
<1%
-
-
5014441
49
<1%
-
-
5727230
48
<1%
-
-
4401506
48
<1%
-
-
5817237
48
<1%
-
-
5540979
48
<1%
-
-
5513739
48
<1%
-
-
4846900
48
<1%
-
-
3346434
46
<1%
-
-
4192639
46
<1%
-
-
3565778
45
<1%
-
-
4277212
95,853
96%
-
-
(Other)
customer_pk
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_num_words
0.03
lang_num_sents
0.03
form_rel_depth
0.03
lang_pct_modal
-0.02
lang_pct_to_infinitive_preposition
-0.01
lang_mean_words_per_sent
0.01
lang_pct_noun_plural
-0.01
lang_pct_sym
-0.01
lang_pct_verb_base_form
-0.01
lang_pct_adverb
0.01
lang_pct_coordinating_conjunction
-0.01
lang_pct_list_marker
-0.01
lang_pct_cardinal_digit
0.01
lang_pct_determiner
0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

style_q
0.02
style_box
0.02
form_font_colour_mode_ind
0.02
style_heading
0.02
style_indent
0.02
para_prec_colour_ind
0.01
target
0.01
is_bold
0.01
para_foll_colour_ind
0.01
style_head_foot
0.01
para_foll_depth_ind
0.01
style_list_num
0.01
para_prec_depth_ind
0.01
style_ans
0.01
MOST FREQUENT VALUES

1549
3,265
3.3%
1024
2,156
2.2%
3423
2,074
2.1%
2999
2,045
2.0%
2131
1,628
1.6%
1093
1,510
1.5%
3264
1,464
1.5%
204
1,360
1.4%
864
1,304
1.3%
1138
1,296
1.3%
1826
1,271
1.3%
2759
1,058
1.1%
3421
1,038
1.0%
1411
1,028
1.0%
878
931
0.9%
SMALLEST VALUES

24
201
0.2%
29
64
<0.1%
34
66
<0.1%
44
64
<0.1%
47
4
<0.1%
54
31
<0.1%
56
187
0.2%
57
1
<0.1%
66
68
<0.1%
73
40
<0.1%
98
44
<0.1%
116
25
<0.1%
120
52
<0.1%
124
1
<0.1%
155
4
<0.1%
LARGEST VALUES

5090
20
<0.1%
5083
18
<0.1%
5082
8
<0.1%
5077
28
<0.1%
5071
13
<0.1%
5063
4
<0.1%
5038
9
<0.1%
5034
2
<0.1%
5031
1
<0.1%
5016
18
<0.1%
5015
7
<0.1%
4984
1
<0.1%
4979
2
<0.1%
4976
11
<0.1%
4962
7
<0.1%
is_italic
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
97,724
98%
0.432
1
2,276
2%
0.398
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
is_italic
PROVIDES INFORMATION ON...

para_foll_italic_ind
0.21
para_prec_italic_ind
0.20
is_underline
0.01
form_font_colour_mode_ind
0.01
style_box
0.00
para_foll_colour_ind
0.00
style_toc
0.00
style_q
0.00
is_bold
0.00
style_cover_nm_add
0.00
para_prec_colour_ind
0.00
para_foll_size_ind
0.00
target
0.00
lang_ls_qm
0.00

THESE FEATURES
GIVE INFORMATION
ON is_italic:

para_foll_italic_ind
0.20
para_prec_italic_ind
0.20
target
0.01
form_font_colour_mode_ind
0.01
is_bold
0.01
para_foll_size_ind
0.01
para_foll_colour_ind
0.01
is_underline
0.01
para_foll_bold_ind
0.01
para_prec_colour_ind
0.01
begins_with
0.01
lang_ls_qm
0.00
para_prec_size_ind
0.00
para_prec_bold_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
is_italic
CORRELATION RATIO WITH...

lang_pct_adverb
0.03
lang_pct_punct
0.03
lang_pct_determiner
0.01
lang_pct_cardinal_digit
0.01
lang_pct_wh_pronoun
0.01
lang_pct_verb_base_form
0.01
lang_pct_personal_pronoun
0.01
lang_pct_verb_3rd_person_sing_present
0.01
lang_pct_wh_abverb
0.01
lang_pct_interjection
0.01
lang_mean_words_per_sent
0.01
target_encoded
0.01
lang_pct_coordinating_conjunction
0.01
lang_pct_preposition_subordinating_conjunction
0.01
is_bold
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
80,909
81%
0.421
1
19,091
19%
0.476
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
is_bold
PROVIDES INFORMATION ON...

para_foll_bold_ind
0.21
para_prec_bold_ind
0.20
is_underline
0.05
style_box
0.03
style_toc
0.03
form_font_colour_mode_ind
0.02
is_title
0.02
target
0.02
is_italic
0.01
style_cover_nm_add
0.01
para_foll_size_ind
0.01
lang_ls_fs
0.01
is_upper
0.01
para_prec_size_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON is_bold:

para_foll_bold_ind
0.21
para_prec_bold_ind
0.20
target
0.03
is_title
0.02
form_font_colour_mode_ind
0.01
lang_ls_fs
0.01
lang_ls_alnum
0.01
para_foll_size_ind
0.01
para_prec_size_ind
0.01
is_underline
0.01
para_prec_depth_ind
0.01
para_prec_colour_ind
0.01
lang_ls_qm
0.01
is_upper
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
is_bold
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.12
lang_pct_proper_noun_singular
0.09
lang_pct_verb_base_form
0.09
lang_pct_preposition_subordinating_conjunction
0.09
lang_pct_to_infinitive_preposition
0.08
lang_pct_verb_3rd_person_sing_present
0.08
lang_pct_noun_singular
0.07
lang_pct_modal
0.07
lang_pct_determiner
0.07
lang_pct_verb_sing_present_non_third_person
0.05
form_rel_font_size
0.05
lang_pct_possessive_pronoun
0.05
lang_pct_adverb
0.05
lang_pct_verb_past_participle
0.04
html_pk
MISSING:
---
156
<1%
-
-
3421890
152
<1%
-
-
5198112
116
<1%
-
-
4151660
114
<1%
-
-
3684965
114
<1%
-
-
3134708
112
<1%
-
-
5338884
112
<1%
-
-
4310018
107
<1%
-
-
5669166
89
<1%
-
-
5446736
88
<1%
-
-
4606763
87
<1%
-
-
5392667
86
<1%
-
-
3811521
81
<1%
-
-
5067507
80
<1%
-
-
4391865
78
<1%
-
-
4395642
75
<1%
-
-
4526372
75
<1%
-
-
4339333
70
<1%
-
-
3857719
70
<1%
-
-
5518767
69
<1%
-
-
4283721
67
<1%
-
-
3216773
66
<1%
-
-
4224938
66
<1%
-
-
5006967
65
<1%
-
-
5318272
64
<1%
-
-
5699476
64
<1%
-
-
4329552
63
<1%
-
-
4330028
63
<1%
-
-
4570001
63
<1%
-
-
4520877
62
<1%
-
-
4128399
62
<1%
-
-
4467890
62
<1%
-
-
5208639
61
<1%
-
-
4646853
60
<1%
-
-
5712687
59
<1%
-
-
4196958
58
<1%
-
-
4146166
57
<1%
-
-
3108164
56
<1%
-
-
4161498
56
<1%
-
-
3031087
56
<1%
-
-
3570070
54
<1%
-
-
3213123
53
<1%
-
-
5533059
52
<1%
-
-
4605600
52
<1%
-
-
4792897
51
<1%
-
-
5721691
51
<1%
-
-
4385201
50
<1%
-
-
3832061
50
<1%
-
-
4760044
50
<1%
-
-
3561143
49
<1%
-
-
5014561
49
<1%
-
-
5727241
48
<1%
-
-
4401507
48
<1%
-
-
5817261
48
<1%
-
-
5541036
48
<1%
-
-
5513770
48
<1%
-
-
4846908
48
<1%
-
-
3346442
46
<1%
-
-
4192856
46
<1%
-
-
3565782
45
<1%
-
-
4277214
95,853
96%
-
-
(Other)
id
MISSING:
---
1
<1%
-
-
384942|5169609|69D269D9
1
<1%
-
-
245324|3561143|3A7780D4
1
<1%
-
-
400757|5351822|038D3302
1
<1%
-
-
376043|5066332|370C7904
1
<1%
-
-
427401|5676925|74B85401
1
<1%
-
-
379048|5102997|52FC1195
1
<1%
-
-
388548|5208639|5900132E
1
<1%
-
-
333182|4585736|7EC07987
1
<1%
-
-
400703|5350890|79B219DE
1
<1%
-
-
433274|5732276|03520662
1
<1%
-
-
205865|3134708|119D558A
1
<1%
-
-
380000|5114706|0342E523
1
<1%
-
-
393278|5261045|08C538E9
1
<1%
-
-
306389|4273287|106E6BA9
1
<1%
-
-
331156|4565770|78B0477C
1
<1%
-
-
421605|5569002|28DC3634
1
<1%
-
-
313860|4359047|62228D62
1
<1%
-
-
221818|3309029|7168F840
1
<1%
-
-
348025|4755705|2D93A8EF
1
<1%
-
-
246493|3574320|para_3d
1
<1%
-
-
377687|5086493|3B453824
1
<1%
-
-
439562|5802215|30375A91
1
<1%
-
-
390440|5229158|35474B98
1
<1%
-
-
428119|5686538|0000002F
1
<1%
-
-
204461|3122010|26F65801
1
<1%
-
-
313938|4359588|6856CD11
1
<1%
-
-
416070|5541036|3D72D2A3
1
<1%
-
-
370913|5008277|1EDBA8F6
1
<1%
-
-
321537|4031246|5A786747
1
<1%
-
-
245434|3562008|415ACA3D
1
<1%
-
-
386187|5181975|78AA59AB
1
<1%
-
-
401155|5356336|50527029
1
<1%
-
-
194138|3014027|6E5D444E
1
<1%
-
-
211250|3189784|5A4C7C40
1
<1%
-
-
322360|4451693|0000008B
1
<1%
-
-
331168|4565960|189C819D
1
<1%
-
-
317893|4401172|5116F5DF
1
<1%
-
-
403755|5392667|0E1F102A
1
<1%
-
-
362555|4919800|para_s
1
<1%
-
-
358403|4867162|696075AE
1
<1%
-
-
365271|4950400|57913EEB
1
<1%
-
-
318526|4408436|para_1j
1
<1%
-
-
282569|3991917|415554F7
1
<1%
-
-
355592|4838399|00000190
1
<1%
-
-
219092|3278968|3EA1EA21
1
<1%
-
-
353350|4812425|7310D735
1
<1%
-
-
273412|3887912|554BD7D2
1
<1%
-
-
348510|4762289|0B252FE1
1
<1%
-
-
241079|3518962|7DC058F6
1
<1%
-
-
307414|4286357|7F9E7DC1
1
<1%
-
-
343442|4700045|68153B73
1
<1%
-
-
395667|5290801|580B0B33
1
<1%
-
-
268493|3830556|1D1C82D8
1
<1%
-
-
307442|4287004|para_1a
1
<1%
-
-
195129|3022939|182542C9
1
<1%
-
-
413841|5509863|420E4A90
1
<1%
-
-
203809|3115122|3D901405
1
<1%
-
-
415962|5540036|77A6727C
1
<1%
-
-
379417|5106001|784FDA0A
1
<1%
-
-
382144|5138747|5425E7BC
99,940
>99%
-
-
(Other)
is_underline
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
98,376
98%
0.428
1
1,624
2%
0.608
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
is_underline
PROVIDES INFORMATION ON...

para_prec_underline_ind
0.14
para_foll_underline_ind
0.13
is_bold
0.01
style_title
0.01
is_italic
0.01
target
0.01
lang_ls_qm
0.00
para_foll_depth_ind
0.00
form_font_colour_mode_ind
0.00
style_cover_nm_add
0.00
para_prec_depth_ind
0.00
para_foll_size_ind
0.00
para_prec_colour_ind
0.00
style_q
0.00

THESE FEATURES
GIVE INFORMATION
ON is_underline:

para_prec_underline_ind
0.15
para_foll_underline_ind
0.13
target
0.06
is_bold
0.05
lang_ls_qm
0.01
para_foll_depth_ind
0.01
para_prec_depth_ind
0.01
is_italic
0.01
para_foll_size_ind
0.01
lang_ls_alnum
0.01
form_font_colour_mode_ind
0.01
para_prec_font_ind
0.01
para_foll_font_ind
0.01
para_prec_size_ind
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
is_underline
CORRELATION RATIO WITH...

lang_pct_proper_noun_singular
0.04
target_encoded
0.04
form_rel_font_size
0.03
lang_pct_verb_3rd_person_sing_present
0.03
lang_pct_verb_base_form
0.03
lang_pct_determiner
0.02
lang_pct_verb_sing_present_non_third_person
0.02
lang_mean_words_per_sent
0.02
lang_pct_personal_pronoun
0.02
lang_pct_to_infinitive_preposition
0.02
lang_pct_wh_abverb
0.02
lang_pct_possessive_pronoun
0.02
lang_pct_wh_pronoun
0.02
lang_pct_preposition_subordinating_conjunction
0.02
target
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
discarded
63,579
64%
0.00
question
31,798
32%
1.00
section
2,526
3%
2.00
subsection
2,097
2%
3.00
ALL
100,000
100%
0.43
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
target
PROVIDES INFORMATION ON...

lang_ls_qm
0.26
lang_ls_alnum
0.12
style_q
0.12
is_title
0.11
is_upper
0.08
style_heading
0.08
style_toc
0.08
begins_with
0.06
is_underline
0.06
style_box
0.04
style_list_num
0.03
lang_ls_fs
0.03
is_bold
0.03
form_font_colour_mode_ind
0.03

THESE FEATURES
GIVE INFORMATION
ON target:

lang_ls_qm
0.12
lang_ls_alnum
0.10
is_title
0.08
begins_with
0.05
is_upper
0.02
style_heading
0.02
lang_ls_fs
0.02
style_list_num
0.02
is_bold
0.02
form_font_family_mode_ind
0.02
style_q
0.01
para_prec_depth_ind
0.01
para_foll_depth_ind
0.01
para_prec_size_ind
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
target
CORRELATION RATIO WITH...

target_encoded
1.00
lang_mean_words_per_sent
0.23
lang_pct_verb_base_form
0.23
lang_pct_possessive_pronoun
0.22
lang_pct_proper_noun_singular
0.22
lang_pct_wh_pronoun
0.22
lang_pct_wh_abverb
0.20
lang_pct_verb_3rd_person_sing_present
0.20
lang_pct_preposition_subordinating_conjunction
0.20
lang_pct_verb_sing_present_non_third_person
0.18
form_rel_font_size
0.17
lang_pct_personal_pronoun
0.15
lang_pct_cardinal_digit
0.14
lang_pct_to_infinitive_preposition
0.13
form_rel_depth
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_num_sents
0.32
lang_num_words
0.31
lang_mean_words_per_sent
0.04
target_encoded
-0.03
customer_pk
0.03
lang_pct_determiner
0.01
lang_pct_proper_noun_singular
-0.01
lang_pct_possessive_pronoun
-0.01
lang_pct_modal
0.01
lang_pct_verb_past_tense
0.01
lang_pct_adverb
0.01
lang_pct_noun_plural
-0.00
lang_pct_wh_pronoun
-0.00
lang_pct_foreign_word
-0.00

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

para_prec_depth_ind
0.13
para_foll_depth_ind
0.10
begins_with
0.03
target
0.03
style_list_num
0.02
style_toc
0.02
para_prec_colour_ind
0.02
style_heading
0.02
lang_ls_clscl
0.02
lang_ls_brkt
0.01
style_table
0.01
para_foll_colour_ind
0.01
para_prec_size_ind
0.01
is_underline
0.01
MOST FREQUENT VALUES

4
36,848
36.8%
1
30,364
30.4%
2
22,659
22.7%
5
5,305
5.3%
7
2,061
2.1%
3
679
0.7%
6
372
0.4%
8
199
0.2%
10
84
<0.1%
9
68
<0.1%
12
57
<0.1%
11
56
<0.1%
13
45
<0.1%
20
43
<0.1%
15
38
<0.1%
SMALLEST VALUES

1
30,364
30.4%
2
22,659
22.7%
3
679
0.7%
4
36,848
36.8%
5
5,305
5.3%
6
372
0.4%
7
2,061
2.1%
8
199
0.2%
9
68
<0.1%
10
84
<0.1%
11
56
<0.1%
12
57
<0.1%
13
45
<0.1%
14
38
<0.1%
15
38
<0.1%
LARGEST VALUES

615
1
<0.1%
613
1
<0.1%
604
1
<0.1%
571
1
<0.1%
554
1
<0.1%
539
1
<0.1%
471
1
<0.1%
456
1
<0.1%
455
1
<0.1%
452
1
<0.1%
447
1
<0.1%
430
1
<0.1%
393
1
<0.1%
383
1
<0.1%
361
1
<0.1%
form_rel_font_size
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

target_encoded
-0.11
lang_pct_proper_noun_singular
0.08
lang_pct_wh_pronoun
-0.05
lang_pct_verb_base_form
-0.04
lang_pct_possessive_pronoun
-0.04
lang_pct_wh_abverb
-0.04
lang_pct_verb_3rd_person_sing_present
-0.04
lang_pct_verb_sing_present_non_third_person
-0.04
lang_pct_cardinal_digit
0.04
lang_pct_determiner
-0.03
lang_pct_personal_pronoun
-0.03
lang_num_words
0.03
lang_pct_to_infinitive_preposition
-0.03
lang_pct_punct
-0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

para_foll_size_ind
0.28
para_prec_size_ind
0.24
target
0.17
form_font_colour_mode_ind
0.11
form_font_family_mode_ind
0.11
lang_ls_qm
0.10
para_prec_colour_ind
0.08
lang_ls_alnum
0.07
para_prec_font_ind
0.07
para_prec_depth_ind
0.06
para_foll_depth_ind
0.06
style_heading
0.06
style_table
0.06
is_bold
0.05
MOST FREQUENT VALUES

1
38,953
39.0%
2
15,257
15.3%
3
12,973
13.0%
5
9,774
9.8%
4
8,678
8.7%
7
5,180
5.2%
6
3,927
3.9%
8
1,414
1.4%
9
1,373
1.4%
10
579
0.6%
11
472
0.5%
14
359
0.4%
12
335
0.3%
13
270
0.3%
16
78
<0.1%
SMALLEST VALUES

1
38,953
39.0%
2
15,257
15.3%
3
12,973
13.0%
4
8,678
8.7%
5
9,774
9.8%
6
3,927
3.9%
7
5,180
5.2%
8
1,414
1.4%
9
1,373
1.4%
10
579
0.6%
11
472
0.5%
12
335
0.3%
13
270
0.3%
14
359
0.4%
15
64
<0.1%
LARGEST VALUES

74
1
<0.1%
73
1
<0.1%
54
1
<0.1%
51
2
<0.1%
49
4
<0.1%
46
1
<0.1%
44
2
<0.1%
40
1
<0.1%
39
2
<0.1%
38
2
<0.1%
37
1
<0.1%
36
4
<0.1%
35
4
<0.1%
34
5
<0.1%
33
1
<0.1%
form_font_family_mode_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
75,519
76%
0.468
0
24,481
24%
0.320
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
form_font_family_mode_ind
PROVIDES INFORMATION ON...

para_foll_font_ind
0.06
para_prec_font_ind
0.06
style_box
0.04
style_cover_nm_add
0.03
style_q
0.02
style_table
0.02
target
0.02
lang_ls_qm
0.01
para_foll_depth_ind
0.01
para_foll_size_ind
0.01
para_prec_depth_ind
0.01
style_bullet
0.01
para_prec_size_ind
0.01
form_font_colour_mode_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON form_font_family_mode_ind:

para_foll_font_ind
0.04
para_prec_font_ind
0.04
target
0.02
para_foll_depth_ind
0.01
para_prec_depth_ind
0.01
style_table
0.01
lang_ls_qm
0.01
para_foll_size_ind
0.01
para_prec_size_ind
0.01
style_q
0.00
form_font_colour_mode_ind
0.00
begins_with
0.00
lang_ls_alnum
0.00
style_heading
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
form_font_family_mode_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.11
target_encoded
0.10
lang_pct_proper_noun_singular
0.06
lang_pct_wh_pronoun
0.05
lang_pct_possessive_pronoun
0.04
lang_pct_cardinal_digit
0.04
lang_pct_wh_abverb
0.04
lang_pct_verb_3rd_person_sing_present
0.03
lang_pct_determiner
0.03
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_personal_pronoun
0.03
lang_pct_verb_base_form
0.03
lang_pct_noun_singular
0.02
lang_pct_interjection
0.02
form_font_colour_mode_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
91,567
92%
0.438
0
8,433
8%
0.358
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
form_font_colour_mode_ind
PROVIDES INFORMATION ON...

para_foll_colour_ind
0.18
para_prec_colour_ind
0.17
style_bullet
0.03
para_foll_size_ind
0.02
is_italic
0.01
para_prec_size_ind
0.01
is_bold
0.01
target
0.01
style_box
0.01
is_underline
0.01
lang_ls_qm
0.01
lang_ls_alnum
0.01
style_heading
0.00
para_prec_italic_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON form_font_colour_mode_ind:

para_foll_colour_ind
0.16
para_prec_colour_ind
0.16
target
0.03
para_foll_size_ind
0.03
is_bold
0.02
para_prec_size_ind
0.02
lang_ls_alnum
0.01
lang_ls_qm
0.01
style_bullet
0.01
is_title
0.01
para_foll_bold_ind
0.01
para_prec_depth_ind
0.01
form_font_family_mode_ind
0.01
is_italic
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
form_font_colour_mode_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.11
lang_mean_words_per_sent
0.07
lang_pct_proper_noun_singular
0.06
lang_pct_preposition_subordinating_conjunction
0.06
lang_pct_determiner
0.05
lang_pct_verb_base_form
0.05
lang_pct_verb_3rd_person_sing_present
0.04
lang_pct_modal
0.04
lang_pct_noun_singular
0.04
lang_pct_to_infinitive_preposition
0.03
target_encoded
0.03
lang_pct_wh_pronoun
0.03
lang_pct_possessive_pronoun
0.03
lang_pct_wh_abverb
0.03
lang_num_sents
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_num_words
0.96
form_rel_depth
0.32
lang_mean_words_per_sent
0.09
lang_pct_preposition_subordinating_conjunction
0.04
customer_pk
0.03
lang_pct_verb_base_form
0.03
lang_pct_modal
0.03
lang_pct_to_infinitive_preposition
0.02
lang_pct_noun_singular
-0.02
form_rel_font_size
0.02
lang_pct_determiner
0.02
lang_pct_proper_noun_singular
-0.02
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_verb_sing_present_non_third_person
0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

para_foll_depth_ind
0.15
para_prec_depth_ind
0.12
lang_ls_fs
0.07
lang_ls_alnum
0.05
is_title
0.04
is_upper
0.02
style_table
0.02
para_foll_bold_ind
0.02
begins_with
0.02
target
0.02
para_prec_underline_ind
0.01
style_list_num
0.01
style_ans
0.01
lang_ls_clscl
0.01
MOST FREQUENT VALUES

1
83,768
83.8%
2
10,591
10.6%
3
3,246
3.2%
4
1,113
1.1%
5
435
0.4%
6
182
0.2%
7
93
<0.1%
8
60
<0.1%
9
39
<0.1%
10
35
<0.1%
12
18
<0.1%
11
17
<0.1%
13
17
<0.1%
20
12
<0.1%
14
10
<0.1%
SMALLEST VALUES

1
83,768
83.8%
2
10,591
10.6%
3
3,246
3.2%
4
1,113
1.1%
5
435
0.4%
6
182
0.2%
7
93
<0.1%
8
60
<0.1%
9
39
<0.1%
10
35
<0.1%
11
17
<0.1%
12
18
<0.1%
13
17
<0.1%
14
10
<0.1%
15
4
<0.1%
LARGEST VALUES

400
1
<0.1%
397
1
<0.1%
396
2
<0.1%
393
2
<0.1%
390
1
<0.1%
386
1
<0.1%
334
1
<0.1%
322
1
<0.1%
313
2
<0.1%
312
1
<0.1%
310
1
<0.1%
307
1
<0.1%
297
2
<0.1%
291
1
<0.1%
280
1
<0.1%
lang_num_words
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_num_sents
0.96
form_rel_depth
0.31
lang_mean_words_per_sent
0.16
lang_pct_preposition_subordinating_conjunction
0.05
lang_pct_to_infinitive_preposition
0.04
lang_pct_modal
0.03
lang_pct_verb_base_form
0.03
lang_pct_noun_singular
-0.03
customer_pk
0.03
lang_pct_determiner
0.03
form_rel_font_size
0.03
lang_pct_coordinating_conjunction
0.03
lang_pct_proper_noun_singular
-0.03
lang_pct_wh_determiner
0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

para_foll_depth_ind
0.15
para_prec_depth_ind
0.12
lang_ls_fs
0.09
lang_ls_alnum
0.06
is_title
0.06
is_upper
0.03
style_table
0.02
para_foll_bold_ind
0.02
begins_with
0.02
para_prec_underline_ind
0.02
target
0.02
para_prec_bold_ind
0.01
style_ans
0.01
lang_ls_clscl
0.01
MOST FREQUENT VALUES

1
12,147
12.1%
2
11,660
11.7%
3
8,037
8.0%
4
5,903
5.9%
5
4,329
4.3%
6
3,769
3.8%
7
3,353
3.4%
8
3,183
3.2%
9
2,699
2.7%
10
2,603
2.6%
11
2,436
2.4%
12
2,332
2.3%
13
2,256
2.3%
14
2,086
2.1%
15
2,004
2.0%
SMALLEST VALUES

1
12,147
12.1%
2
11,660
11.7%
3
8,037
8.0%
4
5,903
5.9%
5
4,329
4.3%
6
3,769
3.8%
7
3,353
3.4%
8
3,183
3.2%
9
2,699
2.7%
10
2,603
2.6%
11
2,436
2.4%
12
2,332
2.3%
13
2,256
2.3%
14
2,086
2.1%
15
2,004
2.0%
LARGEST VALUES

14924
1
<0.1%
13727
1
<0.1%
12164
1
<0.1%
12015
1
<0.1%
12008
1
<0.1%
11774
1
<0.1%
11487
2
<0.1%
11434
1
<0.1%
9163
1
<0.1%
8925
1
<0.1%
8910
1
<0.1%
8796
1
<0.1%
8401
1
<0.1%
8396
1
<0.1%
8358
1
<0.1%
lang_mean_words_per_sent
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_preposition_subordinating_conjunction
0.34
lang_pct_to_infinitive_preposition
0.24
lang_pct_noun_singular
-0.22
lang_pct_verb_base_form
0.21
lang_pct_coordinating_conjunction
0.21
lang_pct_modal
0.18
lang_pct_wh_determiner
0.18
lang_pct_determiner
0.17
lang_pct_proper_noun_singular
-0.16
lang_num_words
0.16
lang_pct_verb_3rd_person_sing_present
0.15
lang_pct_verb_sing_present_non_third_person
0.12
lang_pct_verb_past_participle
0.11
lang_pct_possessive_pronoun
0.11

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.48
lang_ls_fs
0.47
is_title
0.42
target
0.23
is_upper
0.19
lang_ls_qm
0.12
is_bold
0.12
style_list_num
0.10
para_foll_depth_ind
0.10
style_table
0.09
style_heading
0.08
form_font_colour_mode_ind
0.07
begins_with
0.07
style_toc
0.06
MOST FREQUENT VALUES

1.0
12,147
12.1%
2.0
11,726
11.7%
3.0
8,083
8.1%
4.0
5,915
5.9%
5.0
4,386
4.4%
6.0
3,863
3.9%
7.0
3,553
3.6%
8.0
3,380
3.4%
9.0
2,950
3.0%
10.0
2,852
2.9%
11.0
2,651
2.7%
12.0
2,553
2.6%
13.0
2,435
2.4%
14.0
2,201
2.2%
15.0
2,099
2.1%
SMALLEST VALUES

1.0
12,147
12.1%
1.03125
1
<0.1%
1.5
33
<0.1%
1.6666666666666667
3
<0.1%
1.75
1
<0.1%
2.0
11,726
11.7%
2.3333333333333335
1
<0.1%
2.5
56
<0.1%
2.6666666666666665
4
<0.1%
3.0
8,083
8.1%
3.25
1
<0.1%
3.3333333333333335
5
<0.1%
3.375
1
<0.1%
3.5
70
<0.1%
3.6666666666666665
5
<0.1%
LARGEST VALUES

650.0
1
<0.1%
303.0
1
<0.1%
284.0
1
<0.1%
270.0
1
<0.1%
224.0
1
<0.1%
217.5
1
<0.1%
213.0
1
<0.1%
209.0
1
<0.1%
203.0
2
<0.1%
202.0
1
<0.1%
189.0
1
<0.1%
188.0
1
<0.1%
170.2
1
<0.1%
170.0
2
<0.1%
169.0
1
<0.1%
lang_ls_alnum
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
52,925
53%
0.520
1
47,075
47%
0.332
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
lang_ls_alnum
PROVIDES INFORMATION ON...

lang_ls_fs
0.35
lang_ls_qm
0.23
lang_ls_clscl
0.19
lang_ls_brkt
0.18
is_title
0.17
style_toc
0.13
is_upper
0.11
target
0.10
style_heading
0.03
style_q
0.03
style_title
0.02
style_table
0.02
style_list_num
0.02
form_font_colour_mode_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON lang_ls_alnum:

lang_ls_fs
0.28
is_title
0.14
lang_ls_qm
0.13
target
0.12
lang_ls_clscl
0.07
lang_ls_brkt
0.06
is_upper
0.04
style_toc
0.02
style_list_num
0.01
style_heading
0.01
begins_with
0.01
is_bold
0.01
style_table
0.01
form_font_colour_mode_ind
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
lang_ls_alnum
CORRELATION RATIO WITH...

lang_pct_punct
0.53
lang_mean_words_per_sent
0.48
lang_pct_proper_noun_singular
0.26
lang_pct_verb_base_form
0.26
lang_pct_preposition_subordinating_conjunction
0.25
lang_pct_noun_singular
0.24
lang_pct_verb_3rd_person_sing_present
0.22
lang_pct_possessive_pronoun
0.19
lang_pct_to_infinitive_preposition
0.18
lang_pct_verb_sing_present_non_third_person
0.18
lang_pct_modal
0.17
lang_pct_cardinal_digit
0.16
lang_pct_wh_pronoun
0.16
lang_pct_wh_abverb
0.15
lang_ls_qm
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
87,480
87%
0.367
1
12,520
13%
0.882
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
lang_ls_qm
PROVIDES INFORMATION ON...

lang_ls_alnum
0.13
target
0.12
lang_ls_fs
0.07
is_title
0.06
lang_ls_clscl
0.04
lang_ls_brkt
0.04
is_upper
0.03
style_toc
0.02
begins_with
0.02
style_q
0.02
is_underline
0.01
style_list_num
0.01
form_font_colour_mode_ind
0.01
form_font_family_mode_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON lang_ls_qm:

target
0.26
lang_ls_alnum
0.23
lang_ls_fs
0.10
is_title
0.09
begins_with
0.03
lang_ls_clscl
0.03
lang_ls_brkt
0.02
is_upper
0.02
style_list_num
0.01
form_font_family_mode_ind
0.01
is_bold
0.01
para_foll_depth_ind
0.01
form_font_colour_mode_ind
0.01
para_prec_depth_ind
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
lang_ls_qm
CORRELATION RATIO WITH...

lang_pct_wh_pronoun
0.37
lang_pct_verb_3rd_person_sing_present
0.28
lang_pct_wh_abverb
0.27
target_encoded
0.26
lang_pct_verb_sing_present_non_third_person
0.23
lang_pct_personal_pronoun
0.21
lang_pct_possessive_pronoun
0.17
lang_pct_proper_noun_singular
0.16
lang_pct_existential_there
0.14
lang_pct_verb_base_form
0.14
lang_mean_words_per_sent
0.12
form_rel_font_size
0.10
lang_pct_noun_singular
0.09
lang_pct_verb_past_participle
0.09
lang_ls_fs
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
75,071
75%
0.423
1
24,929
25%
0.457
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
lang_ls_fs
PROVIDES INFORMATION ON...

lang_ls_alnum
0.28
is_title
0.12
lang_ls_qm
0.10
lang_ls_clscl
0.09
lang_ls_brkt
0.08
is_upper
0.06
style_toc
0.05
target
0.02
para_foll_underline_ind
0.02
style_q
0.01
style_heading
0.01
para_prec_underline_ind
0.01
is_bold
0.01
style_title
0.01

THESE FEATURES
GIVE INFORMATION
ON lang_ls_fs:

lang_ls_alnum
0.35
is_title
0.12
lang_ls_qm
0.07
lang_ls_clscl
0.04
lang_ls_brkt
0.03
target
0.03
is_upper
0.03
is_bold
0.01
style_toc
0.01
style_heading
0.01
style_list_num
0.00
style_table
0.00
begins_with
0.00
style_q
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
lang_ls_fs
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.47
lang_pct_preposition_subordinating_conjunction
0.25
lang_pct_verb_base_form
0.22
lang_pct_to_infinitive_preposition
0.19
lang_pct_modal
0.17
lang_pct_proper_noun_singular
0.17
lang_pct_determiner
0.15
lang_pct_noun_singular
0.15
lang_pct_wh_determiner
0.13
lang_pct_coordinating_conjunction
0.13
lang_pct_possessive_pronoun
0.12
lang_pct_punct
0.12
lang_pct_cardinal_digit
0.11
lang_pct_verb_past_participle
0.09
lang_ls_clscl
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
92,323
92%
0.437
1
7,677
8%
0.361
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
lang_ls_clscl
PROVIDES INFORMATION ON...

lang_ls_alnum
0.07
lang_ls_fs
0.04
lang_ls_qm
0.03
lang_ls_brkt
0.02
style_toc
0.01
para_foll_depth_ind
0.01
is_underline
0.00
is_upper
0.00
style_heading
0.00
para_foll_font_ind
0.00
para_prec_depth_ind
0.00
para_prec_underline_ind
0.00
target
0.00
style_table
0.00

THESE FEATURES
GIVE INFORMATION
ON lang_ls_clscl:

lang_ls_alnum
0.19
lang_ls_fs
0.09
lang_ls_qm
0.04
para_foll_depth_ind
0.02
lang_ls_brkt
0.02
target
0.00
style_toc
0.00
para_prec_depth_ind
0.00
para_foll_font_ind
0.00
is_upper
0.00
style_heading
0.00
form_font_family_mode_ind
0.00
is_underline
0.00
para_prec_bold_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
lang_ls_clscl
CORRELATION RATIO WITH...

lang_pct_punct
0.35
lang_pct_proper_noun_singular
0.06
lang_pct_cardinal_digit
0.05
lang_pct_wh_pronoun
0.04
lang_pct_coordinating_conjunction
0.04
target_encoded
0.03
lang_pct_noun_singular
0.03
lang_pct_interjection
0.03
lang_pct_wh_abverb
0.02
lang_pct_adverb
0.02
lang_pct_personal_pronoun
0.02
lang_pct_preposition_subordinating_conjunction
0.02
form_rel_depth
0.02
lang_pct_possessive_pronoun
0.02
lang_ls_brkt
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
93,818
94%
0.439
1
6,182
6%
0.315
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
lang_ls_brkt
PROVIDES INFORMATION ON...

lang_ls_alnum
0.06
lang_ls_fs
0.03
lang_ls_qm
0.02
lang_ls_clscl
0.02
style_toc
0.01
is_title
0.00
begins_with
0.00
is_bold
0.00
para_prec_underline_ind
0.00
is_upper
0.00
style_q
0.00
is_underline
0.00
para_foll_bold_ind
0.00
para_prec_bold_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON lang_ls_brkt:

lang_ls_alnum
0.18
lang_ls_fs
0.08
lang_ls_qm
0.04
lang_ls_clscl
0.02
begins_with
0.01
is_title
0.01
is_bold
0.01
target
0.01
para_foll_bold_ind
0.00
style_toc
0.00
para_prec_bold_ind
0.00
is_upper
0.00
para_foll_depth_ind
0.00
style_heading
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
lang_ls_brkt
CORRELATION RATIO WITH...

lang_pct_punct
0.38
lang_pct_noun_singular
0.09
lang_pct_preposition_subordinating_conjunction
0.05
lang_pct_noun_plural
0.05
lang_pct_verb_3rd_person_sing_present
0.05
target_encoded
0.05
lang_pct_proper_noun_singular
0.05
lang_pct_coordinating_conjunction
0.05
lang_pct_possessive_pronoun
0.04
lang_pct_modal
0.04
lang_pct_to_infinitive_preposition
0.04
lang_pct_determiner
0.04
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_verb_base_form
0.03
para_prec_depth_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
83,900
84%
0.417
1
8,556
9%
0.478
-1
7,544
8%
0.534
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_depth_ind
PROVIDES INFORMATION ON...

para_prec_font_ind
0.17
para_foll_depth_ind
0.15
para_prec_size_ind
0.10
style_heading
0.04
para_prec_colour_ind
0.04
para_foll_font_ind
0.03
para_foll_size_ind
0.02
begins_with
0.02
style_cover_nm_add
0.02
style_list_num
0.02
target
0.01
style_toc
0.01
is_underline
0.01
para_foll_colour_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON para_prec_depth_ind:

para_foll_depth_ind
0.15
para_prec_font_ind
0.12
para_prec_size_ind
0.08
begins_with
0.02
para_foll_font_ind
0.02
target
0.02
para_foll_size_ind
0.02
para_prec_colour_ind
0.02
style_heading
0.02
style_list_num
0.01
form_font_family_mode_ind
0.01
is_bold
0.01
para_foll_colour_ind
0.00
style_table
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_depth_ind
CORRELATION RATIO WITH...

form_rel_depth
0.13
lang_num_words
0.12
lang_num_sents
0.12
form_rel_font_size
0.06
target_encoded
0.05
lang_pct_cardinal_digit
0.05
lang_mean_words_per_sent
0.04
lang_pct_wh_abverb
0.03
lang_pct_noun_singular
0.03
lang_pct_wh_pronoun
0.03
lang_pct_modal
0.03
lang_pct_proper_noun_singular
0.03
lang_pct_coordinating_conjunction
0.03
lang_pct_determiner
0.02
para_foll_depth_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
83,597
84%
0.419
-1
8,519
9%
0.494
1
7,884
8%
0.494
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_depth_ind
PROVIDES INFORMATION ON...

para_foll_font_ind
0.17
para_prec_depth_ind
0.15
para_foll_size_ind
0.10
para_foll_colour_ind
0.04
para_prec_font_ind
0.03
style_heading
0.03
lang_ls_clscl
0.02
para_prec_size_ind
0.02
begins_with
0.02
style_list_num
0.02
para_foll_underline_ind
0.02
style_toc
0.01
is_underline
0.01
target
0.01

THESE FEATURES
GIVE INFORMATION
ON para_foll_depth_ind:

para_prec_depth_ind
0.15
para_foll_font_ind
0.12
para_foll_size_ind
0.08
begins_with
0.02
para_prec_font_ind
0.02
para_foll_colour_ind
0.02
para_prec_size_ind
0.02
target
0.02
style_list_num
0.01
style_heading
0.01
lang_ls_clscl
0.01
form_font_family_mode_ind
0.01
para_foll_bold_ind
0.01
style_table
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_depth_ind
CORRELATION RATIO WITH...

lang_num_words
0.15
lang_num_sents
0.15
form_rel_depth
0.10
lang_mean_words_per_sent
0.10
form_rel_font_size
0.06
lang_pct_noun_singular
0.05
lang_pct_modal
0.05
target_encoded
0.04
lang_pct_determiner
0.04
lang_pct_cardinal_digit
0.04
lang_pct_preposition_subordinating_conjunction
0.04
lang_pct_verb_base_form
0.03
lang_pct_wh_pronoun
0.03
lang_pct_wh_abverb
0.03
para_prec_size_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
87,996
88%
0.425
-1
6,238
6%
0.377
1
5,766
6%
0.595
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_size_ind
PROVIDES INFORMATION ON...

para_foll_size_ind
0.22
para_prec_depth_ind
0.08
para_prec_font_ind
0.08
para_prec_colour_ind
0.08
style_heading
0.03
para_foll_colour_ind
0.02
form_font_colour_mode_ind
0.02
para_foll_font_ind
0.02
para_foll_depth_ind
0.02
para_prec_bold_ind
0.01
target
0.01
is_bold
0.01
para_prec_italic_ind
0.01
style_title
0.01

THESE FEATURES
GIVE INFORMATION
ON para_prec_size_ind:

para_foll_size_ind
0.22
para_prec_depth_ind
0.10
para_prec_font_ind
0.07
para_prec_colour_ind
0.05
para_foll_depth_ind
0.02
target
0.02
para_foll_font_ind
0.02
style_heading
0.02
para_foll_colour_ind
0.01
form_font_colour_mode_ind
0.01
para_prec_bold_ind
0.01
is_bold
0.01
form_font_family_mode_ind
0.01
style_list_num
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_size_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.24
target_encoded
0.06
lang_pct_proper_noun_singular
0.06
lang_mean_words_per_sent
0.05
lang_pct_determiner
0.03
lang_pct_preposition_subordinating_conjunction
0.03
lang_pct_wh_abverb
0.03
lang_pct_verb_3rd_person_sing_present
0.03
lang_pct_verb_base_form
0.03
lang_pct_to_infinitive_preposition
0.03
lang_pct_punct
0.03
lang_pct_verb_sing_present_non_third_person
0.02
lang_pct_wh_pronoun
0.02
lang_pct_noun_plural
0.02
para_foll_size_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
87,910
88%
0.427
1
6,259
6%
0.569
-1
5,831
6%
0.345
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_size_ind
PROVIDES INFORMATION ON...

para_prec_size_ind
0.22
para_foll_depth_ind
0.08
para_foll_font_ind
0.08
para_foll_colour_ind
0.07
para_prec_colour_ind
0.03
style_heading
0.03
form_font_colour_mode_ind
0.03
style_title
0.02
para_prec_font_ind
0.02
para_prec_depth_ind
0.02
is_bold
0.01
para_foll_bold_ind
0.01
target
0.01
para_foll_underline_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON para_foll_size_ind:

para_prec_size_ind
0.22
para_foll_depth_ind
0.10
para_foll_font_ind
0.07
para_foll_colour_ind
0.04
para_prec_depth_ind
0.02
para_prec_colour_ind
0.02
para_prec_font_ind
0.02
target
0.02
form_font_colour_mode_ind
0.02
style_heading
0.02
is_bold
0.01
para_foll_bold_ind
0.01
form_font_family_mode_ind
0.01
lang_ls_alnum
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_size_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.28
lang_pct_proper_noun_singular
0.06
target_encoded
0.06
lang_mean_words_per_sent
0.05
lang_pct_punct
0.03
lang_pct_verb_base_form
0.03
lang_pct_verb_3rd_person_sing_present
0.03
lang_pct_wh_pronoun
0.03
lang_pct_wh_abverb
0.03
lang_pct_determiner
0.03
lang_pct_preposition_subordinating_conjunction
0.03
lang_pct_to_infinitive_preposition
0.02
lang_pct_noun_plural
0.02
lang_pct_verb_sing_present_non_third_person
0.02
para_prec_bold_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
80,945
81%
0.438
1.0
19,055
19%
0.403
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_bold_ind
PROVIDES INFORMATION ON...

para_foll_bold_ind
0.21
is_bold
0.20
para_prec_underline_ind
0.06
style_toc
0.02
style_box
0.02
para_prec_italic_ind
0.01
para_prec_size_ind
0.01
style_cover_nm_add
0.01
para_prec_colour_ind
0.01
form_font_colour_mode_ind
0.00
para_foll_underline_ind
0.00
is_italic
0.00
lang_ls_brkt
0.00
style_bullet
0.00

THESE FEATURES
GIVE INFORMATION
ON para_prec_bold_ind:

para_foll_bold_ind
0.21
is_bold
0.20
para_prec_size_ind
0.01
para_prec_underline_ind
0.01
style_toc
0.00
para_prec_italic_ind
0.00
para_prec_colour_ind
0.00
para_prec_depth_ind
0.00
form_font_colour_mode_ind
0.00
style_list_num
0.00
lang_ls_brkt
0.00
lang_ls_qm
0.00
style_heading
0.00
para_foll_depth_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_bold_ind
CORRELATION RATIO WITH...

lang_pct_punct
0.03
lang_pct_cardinal_digit
0.02
lang_pct_noun_singular
0.02
target_encoded
0.02
lang_pct_coordinating_conjunction
0.02
lang_pct_verb_base_form
0.02
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_verb_sing_present_non_third_person
0.02
lang_pct_adjective
0.01
lang_pct_verb_gerund_present_participle
0.01
lang_pct_wh_pronoun
0.01
lang_pct_wh_abverb
0.01
lang_pct_adverb
0.01
lang_num_words
0.01
para_foll_bold_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
81,153
81%
0.442
1.0
18,847
19%
0.387
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_bold_ind
PROVIDES INFORMATION ON...

is_bold
0.21
para_prec_bold_ind
0.21
para_foll_underline_ind
0.05
style_toc
0.02
para_foll_italic_ind
0.01
style_box
0.01
style_cover_nm_add
0.01
para_foll_size_ind
0.01
para_foll_depth_ind
0.01
para_foll_colour_ind
0.01
form_font_colour_mode_ind
0.01
is_italic
0.01
para_prec_colour_ind
0.00
para_prec_italic_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON para_foll_bold_ind:

is_bold
0.21
para_prec_bold_ind
0.21
para_foll_size_ind
0.01
para_foll_underline_ind
0.01
para_foll_depth_ind
0.01
form_font_colour_mode_ind
0.00
para_foll_colour_ind
0.00
target
0.00
style_toc
0.00
para_foll_italic_ind
0.00
para_prec_colour_ind
0.00
para_prec_size_ind
0.00
lang_ls_brkt
0.00
lang_ls_qm
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_bold_ind
CORRELATION RATIO WITH...

lang_pct_punct
0.04
target_encoded
0.03
form_rel_font_size
0.03
lang_pct_noun_plural
0.03
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_coordinating_conjunction
0.02
lang_pct_cardinal_digit
0.02
lang_num_words
0.02
lang_num_sents
0.02
lang_pct_noun_singular
0.02
lang_pct_possessive_pronoun
0.02
lang_pct_wh_pronoun
0.02
lang_pct_proper_noun_singular
0.02
lang_pct_verb_base_form
0.01
para_prec_italic_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
97,763
98%
0.430
1.0
2,237
2%
0.489
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_italic_ind
PROVIDES INFORMATION ON...

para_foll_italic_ind
0.22
is_italic
0.20
para_prec_underline_ind
0.01
style_box
0.00
para_prec_bold_ind
0.00
para_prec_colour_ind
0.00
style_cover_nm_add
0.00
para_foll_underline_ind
0.00
style_toc
0.00
para_prec_size_ind
0.00
style_ans
0.00
style_q
0.00
form_font_colour_mode_ind
0.00
is_bold
0.00

THESE FEATURES
GIVE INFORMATION
ON para_prec_italic_ind:

para_foll_italic_ind
0.22
is_italic
0.20
para_prec_bold_ind
0.01
para_prec_underline_ind
0.01
para_prec_size_ind
0.01
para_prec_colour_ind
0.01
is_bold
0.01
para_prec_depth_ind
0.00
para_foll_bold_ind
0.00
para_foll_depth_ind
0.00
form_font_colour_mode_ind
0.00
para_foll_size_ind
0.00
para_foll_font_ind
0.00
para_prec_font_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_italic_ind
CORRELATION RATIO WITH...

target_encoded
0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_punct
0.01
lang_mean_words_per_sent
0.01
lang_pct_cardinal_digit
0.01
form_rel_font_size
0.01
lang_pct_wh_pronoun
0.01
lang_pct_particle
0.01
lang_pct_verb_base_form
0.01
lang_pct_noun_singular
0.00
lang_pct_proper_noun_singular
0.00
lang_pct_possessive_pronoun
0.00
lang_pct_possessive_wh_pronoun
0.00
lang_pct_to_infinitive_preposition
0.00
para_foll_italic_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
97,808
98%
0.431
1.0
2,192
2%
0.438
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_italic_ind
PROVIDES INFORMATION ON...

para_prec_italic_ind
0.22
is_italic
0.20
para_foll_underline_ind
0.01
para_foll_bold_ind
0.00
style_box
0.00
style_toc
0.00
style_cover_nm_add
0.00
para_foll_colour_ind
0.00
style_q
0.00
para_foll_size_ind
0.00
para_prec_colour_ind
0.00
para_prec_underline_ind
0.00
form_font_colour_mode_ind
0.00
is_bold
0.00

THESE FEATURES
GIVE INFORMATION
ON para_foll_italic_ind:

para_prec_italic_ind
0.22
is_italic
0.21
para_foll_bold_ind
0.01
para_foll_underline_ind
0.01
para_foll_size_ind
0.01
is_bold
0.00
para_foll_colour_ind
0.00
form_font_colour_mode_ind
0.00
para_prec_colour_ind
0.00
para_foll_depth_ind
0.00
para_prec_depth_ind
0.00
para_prec_size_ind
0.00
para_prec_bold_ind
0.00
para_prec_font_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_italic_ind
CORRELATION RATIO WITH...

lang_pct_cardinal_digit
0.01
lang_mean_words_per_sent
0.01
form_rel_font_size
0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_punct
0.01
lang_pct_noun_singular
0.01
lang_pct_verb_base_form
0.01
lang_pct_wh_abverb
0.01
lang_pct_proper_noun_plural
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_possessive_pronoun
0.01
lang_pct_particle
0.01
lang_pct_personal_pronoun
0.01
lang_pct_wh_pronoun
0.01
para_prec_underline_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
98,363
98%
0.432
1.0
1,637
2%
0.420
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_underline_ind
PROVIDES INFORMATION ON...

para_foll_underline_ind
0.15
is_underline
0.15
para_prec_bold_ind
0.01
para_prec_italic_ind
0.01
lang_ls_fs
0.00
style_cover_nm_add
0.00
style_q
0.00
para_prec_depth_ind
0.00
style_toc
0.00
para_foll_italic_ind
0.00
lang_ls_brkt
0.00
style_box
0.00
para_prec_size_ind
0.00
lang_ls_alnum
0.00

THESE FEATURES
GIVE INFORMATION
ON para_prec_underline_ind:

para_foll_underline_ind
0.15
is_underline
0.14
para_prec_bold_ind
0.06
lang_ls_fs
0.01
para_prec_italic_ind
0.01
para_prec_depth_ind
0.01
lang_ls_alnum
0.01
para_prec_size_ind
0.00
para_foll_depth_ind
0.00
para_prec_font_ind
0.00
para_foll_bold_ind
0.00
is_title
0.00
lang_ls_brkt
0.00
is_bold
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_underline_ind
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.05
lang_pct_modal
0.03
lang_pct_preposition_subordinating_conjunction
0.02
lang_num_words
0.02
lang_pct_determiner
0.02
lang_num_sents
0.01
form_rel_font_size
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_verb_base_form
0.01
lang_pct_wh_determiner
0.01
lang_pct_interjection
0.01
lang_pct_coordinating_conjunction
0.01
lang_pct_cardinal_digit
0.01
lang_pct_noun_singular
0.01
para_foll_underline_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0.0
98,374
98%
0.432
1.0
1,626
2%
0.386
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_underline_ind
PROVIDES INFORMATION ON...

para_prec_underline_ind
0.15
is_underline
0.13
para_foll_bold_ind
0.01
para_foll_italic_ind
0.01
para_foll_depth_ind
0.00
lang_ls_fs
0.00
style_cover_nm_add
0.00
para_prec_italic_ind
0.00
para_foll_size_ind
0.00
para_foll_font_ind
0.00
style_q
0.00
style_table
0.00
style_title
0.00
style_toc
0.00

THESE FEATURES
GIVE INFORMATION
ON para_foll_underline_ind:

para_prec_underline_ind
0.15
is_underline
0.13
para_foll_bold_ind
0.05
para_foll_depth_ind
0.02
lang_ls_fs
0.02
para_foll_size_ind
0.01
para_foll_italic_ind
0.01
para_foll_font_ind
0.01
lang_ls_alnum
0.01
style_table
0.00
para_prec_bold_ind
0.00
is_title
0.00
para_prec_italic_ind
0.00
para_foll_colour_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_underline_ind
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.05
lang_pct_modal
0.02
lang_pct_preposition_subordinating_conjunction
0.02
form_rel_font_size
0.02
lang_pct_noun_singular
0.02
lang_pct_to_infinitive_preposition
0.01
lang_pct_punct
0.01
lang_pct_adverb
0.01
lang_pct_verb_base_form
0.01
lang_pct_interjection
0.01
form_rel_depth
0.01
lang_pct_possessive_wh_pronoun
0.01
lang_pct_determiner
0.01
target_encoded
0.01
para_prec_font_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
86,733
87%
0.427
0
13,267
13%
0.460
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_font_ind
PROVIDES INFORMATION ON...

para_foll_font_ind
0.13
para_prec_depth_ind
0.12
para_prec_size_ind
0.07
para_prec_colour_ind
0.05
form_font_family_mode_ind
0.04
style_heading
0.03
para_foll_depth_ind
0.02
para_foll_size_ind
0.02
style_title
0.01
style_table
0.01
style_toc
0.01
style_bullet
0.01
para_foll_colour_ind
0.01
style_indent
0.01

THESE FEATURES
GIVE INFORMATION
ON para_prec_font_ind:

para_prec_depth_ind
0.17
para_foll_font_ind
0.13
para_prec_size_ind
0.08
form_font_family_mode_ind
0.06
para_foll_depth_ind
0.03
para_prec_colour_ind
0.03
para_foll_size_ind
0.02
style_heading
0.02
target
0.01
style_table
0.01
para_foll_colour_ind
0.01
style_toc
0.00
is_bold
0.00
form_font_colour_mode_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_font_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.07
lang_pct_proper_noun_singular
0.04
lang_pct_interjection
0.03
lang_pct_cardinal_digit
0.03
lang_pct_wh_pronoun
0.02
lang_pct_wh_abverb
0.02
target_encoded
0.02
lang_pct_adjective
0.02
lang_pct_punct
0.01
lang_pct_coordinating_conjunction
0.01
lang_pct_verb_3rd_person_sing_present
0.01
lang_pct_noun_singular
0.01
lang_pct_list_marker
0.01
lang_num_words
0.01
para_foll_font_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
86,738
87%
0.427
0
13,262
13%
0.462
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_font_ind
PROVIDES INFORMATION ON...

para_prec_font_ind
0.13
para_foll_depth_ind
0.12
para_foll_size_ind
0.07
para_foll_colour_ind
0.05
form_font_family_mode_ind
0.04
style_heading
0.03
para_prec_depth_ind
0.02
para_prec_size_ind
0.02
style_toc
0.02
style_table
0.01
style_bullet
0.01
para_foll_underline_ind
0.01
para_prec_colour_ind
0.01
style_indent
0.01

THESE FEATURES
GIVE INFORMATION
ON para_foll_font_ind:

para_foll_depth_ind
0.17
para_prec_font_ind
0.13
para_foll_size_ind
0.08
form_font_family_mode_ind
0.06
para_foll_colour_ind
0.04
para_prec_depth_ind
0.03
style_heading
0.02
para_prec_size_ind
0.02
style_table
0.01
target
0.01
para_prec_colour_ind
0.00
style_toc
0.00
lang_ls_clscl
0.00
style_bullet
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_font_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.04
lang_pct_determiner
0.04
lang_pct_cardinal_digit
0.03
lang_pct_modal
0.03
lang_pct_verb_base_form
0.03
lang_pct_noun_singular
0.03
lang_pct_wh_pronoun
0.02
lang_mean_words_per_sent
0.02
target_encoded
0.02
lang_pct_wh_abverb
0.01
lang_pct_adverb
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_interjection
0.01
lang_pct_proper_noun_singular
0.01
para_prec_colour_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
92,352
92%
0.428
0
7,648
8%
0.469
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_prec_colour_ind
PROVIDES INFORMATION ON...

para_foll_colour_ind
0.19
form_font_colour_mode_ind
0.16
para_prec_size_ind
0.05
para_prec_font_ind
0.03
para_foll_size_ind
0.02
para_prec_depth_ind
0.02
style_title
0.01
para_prec_italic_ind
0.01
is_bold
0.01
style_heading
0.01
style_toc
0.01
is_italic
0.01
style_head_foot
0.00
is_underline
0.00

THESE FEATURES
GIVE INFORMATION
ON para_prec_colour_ind:

para_foll_colour_ind
0.19
form_font_colour_mode_ind
0.17
para_prec_size_ind
0.08
para_prec_font_ind
0.05
para_prec_depth_ind
0.04
para_foll_size_ind
0.03
target
0.01
is_bold
0.01
para_foll_depth_ind
0.01
para_foll_font_ind
0.01
para_prec_bold_ind
0.01
para_foll_bold_ind
0.00
style_heading
0.00
style_list_num
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_prec_colour_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.08
lang_pct_proper_noun_singular
0.05
lang_mean_words_per_sent
0.04
lang_pct_preposition_subordinating_conjunction
0.03
lang_pct_determiner
0.03
lang_pct_to_infinitive_preposition
0.02
lang_pct_cardinal_digit
0.02
form_rel_depth
0.02
lang_pct_verb_base_form
0.02
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_wh_abverb
0.02
lang_pct_adjective
0.02
target_encoded
0.02
customer_pk
0.01
para_foll_colour_ind
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
1
92,454
92%
0.429
0
7,546
8%
0.461
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
para_foll_colour_ind
PROVIDES INFORMATION ON...

para_prec_colour_ind
0.19
form_font_colour_mode_ind
0.16
para_foll_size_ind
0.04
para_foll_font_ind
0.04
para_foll_depth_ind
0.02
para_prec_size_ind
0.01
is_italic
0.01
style_toc
0.01
para_prec_font_ind
0.01
para_prec_depth_ind
0.00
style_heading
0.00
para_foll_italic_ind
0.00
style_box
0.00
para_foll_bold_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON para_foll_colour_ind:

para_prec_colour_ind
0.19
form_font_colour_mode_ind
0.18
para_foll_size_ind
0.07
para_foll_font_ind
0.05
para_foll_depth_ind
0.04
para_prec_size_ind
0.02
para_prec_depth_ind
0.01
para_prec_font_ind
0.01
para_foll_bold_ind
0.01
is_bold
0.01
target
0.01
style_heading
0.00
is_italic
0.00
begins_with
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
para_foll_colour_ind
CORRELATION RATIO WITH...

form_rel_font_size
0.05
lang_pct_proper_noun_singular
0.02
lang_pct_cardinal_digit
0.02
lang_pct_preposition_subordinating_conjunction
0.02
lang_mean_words_per_sent
0.02
lang_pct_wh_abverb
0.02
customer_pk
0.01
target_encoded
0.01
lang_pct_interjection
0.01
lang_pct_particle
0.01
form_rel_depth
0.01
lang_pct_adverb_comparative
0.01
lang_pct_wh_determiner
0.01
lang_pct_foreign_word
0.01
is_upper
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
93,546
94%
0.446
1
6,454
6%
0.221
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
is_upper
PROVIDES INFORMATION ON...

lang_ls_alnum
0.04
lang_ls_fs
0.03
target
0.02
lang_ls_qm
0.02
style_list_num
0.01
style_bullet
0.01
style_q
0.01
style_cover_nm_add
0.01
style_table
0.01
style_box
0.01
is_bold
0.01
is_underline
0.00
style_ans
0.00
style_head_foot
0.00

THESE FEATURES
GIVE INFORMATION
ON is_upper:

lang_ls_alnum
0.11
target
0.08
lang_ls_fs
0.06
lang_ls_qm
0.03
style_list_num
0.02
is_bold
0.01
style_table
0.01
style_q
0.00
para_prec_size_ind
0.00
style_bullet
0.00
begins_with
0.00
lang_ls_brkt
0.00
style_heading
0.00
para_foll_size_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
is_upper
CORRELATION RATIO WITH...

lang_pct_noun_singular
0.28
lang_mean_words_per_sent
0.19
lang_pct_preposition_subordinating_conjunction
0.18
lang_pct_verb_base_form
0.12
lang_pct_noun_plural
0.11
lang_pct_to_infinitive_preposition
0.11
lang_pct_coordinating_conjunction
0.10
lang_pct_punct
0.10
lang_pct_verb_past_participle
0.09
lang_pct_verb_3rd_person_sing_present
0.09
lang_pct_determiner
0.09
target_encoded
0.09
lang_pct_sym
0.08
lang_pct_possessive_pronoun
0.08
is_title
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
75,070
75%
0.479
1
24,930
25%
0.289
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
is_title
PROVIDES INFORMATION ON...

lang_ls_alnum
0.14
lang_ls_fs
0.12
lang_ls_qm
0.09
target
0.08
style_toc
0.03
style_list_num
0.02
is_bold
0.02
style_q
0.02
style_bullet
0.01
style_table
0.01
style_heading
0.01
lang_ls_brkt
0.01
form_font_colour_mode_ind
0.01
is_underline
0.01

THESE FEATURES
GIVE INFORMATION
ON is_title:

lang_ls_alnum
0.17
lang_ls_fs
0.12
target
0.11
lang_ls_qm
0.06
style_list_num
0.02
is_bold
0.02
style_table
0.00
style_toc
0.00
begins_with
0.00
style_heading
0.00
form_font_colour_mode_ind
0.00
style_q
0.00
lang_ls_brkt
0.00
style_bullet
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
is_title
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.42
lang_pct_preposition_subordinating_conjunction
0.40
lang_pct_verb_base_form
0.25
lang_pct_coordinating_conjunction
0.24
lang_pct_to_infinitive_preposition
0.24
lang_pct_noun_singular
0.20
lang_pct_verb_3rd_person_sing_present
0.18
lang_pct_proper_noun_singular
0.17
lang_pct_interjection
0.17
lang_pct_verb_sing_present_non_third_person
0.15
lang_pct_possessive_pronoun
0.15
lang_pct_modal
0.15
target_encoded
0.13
lang_pct_wh_abverb
0.12
style
MISSING:
---
50,364
50%
-
-
default
14,284
14%
-
-
listparagraph
5,120
5%
-
-
tableparagraph
3,395
3%
-
-
bodytext
2,971
3%
-
-
normal
2,000
2%
-
-
heading2
1,215
1%
-
-
heading1
1,142
1%
-
-
heading3
903
<1%
-
-
nospacing
763
<1%
-
-
normalweb
689
<1%
-
-
toc2
581
<1%
-
-
toc1
558
<1%
-
-
tabletext
422
<1%
-
-
question
400
<1%
-
-
listnumber
308
<1%
-
-
heading4
297
<1%
-
-
normal1
266
<1%
-
-
paragraphedeliste
246
<1%
-
-
dvquestion
244
<1%
-
-
header
199
<1%
-
-
listbullet
198
<1%
-
-
toc3
197
<1%
-
-
body
181
<1%
-
-
paragraph
158
<1%
-
-
rfptext
158
<1%
-
-
bodytext2
138
<1%
-
-
textbody
138
<1%
-
-
tempnormal2
138
<1%
-
-
listbullet2
133
<1%
-
-
bodycopy
120
<1%
-
-
text
115
<1%
-
-
title
113
<1%
-
-
himbodytext
111
<1%
-
-
fnewbodytext
108
<1%
-
-
70tabletext
99
<1%
-
-
aimaddqquestion
98
<1%
-
-
heading5
97
<1%
-
-
bodytext1
97
<1%
-
-
rfpquestionsbox
93
<1%
-
-
esisquestion
92
<1%
-
-
bullet1
91
<1%
-
-
bodytextindent
87
<1%
-
-
inspring
80
<1%
-
-
heading6
79
<1%
-
-
tablequestion
77
<1%
-
-
plaintext
77
<1%
-
-
rfptabletext
74
<1%
-
-
rfpquestion
73
<1%
-
-
tableheading
71
<1%
-
-
tabel
71
<1%
-
-
tablebody
71
<1%
-
-
answer
69
<1%
-
-
bullet
65
<1%
-
-
prrafodelista
63
<1%
-
-
q1-questiontext
62
<1%
-
-
dvanswer
62
<1%
-
-
style1
61
<1%
-
-
tabelleaufzhlung1
61
<1%
-
-
rfpquestionbox
58
<1%
-
-
subtitle
9,699
10%
-
-
(Other)
style_bullet
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
98,333
98%
0.434
1
1,667
2%
0.274
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_bullet
PROVIDES INFORMATION ON...

form_font_colour_mode_ind
0.01
style_ans
0.01
style_heading
0.00
is_upper
0.00
style_toc
0.00
style_list_num
0.00
style_head_foot
0.00
style_title
0.00
style_q
0.00
begins_with
0.00
para_prec_font_ind
0.00
style_cover_nm_add
0.00
para_foll_font_ind
0.00
is_title
0.00

THESE FEATURES
GIVE INFORMATION
ON style_bullet:

form_font_colour_mode_ind
0.03
style_list_num
0.01
begins_with
0.01
style_heading
0.01
is_upper
0.01
is_title
0.01
form_font_family_mode_ind
0.01
para_prec_font_ind
0.01
para_foll_font_ind
0.01
target
0.01
para_foll_depth_ind
0.01
para_prec_depth_ind
0.00
is_bold
0.00
para_prec_bold_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_bullet
CORRELATION RATIO WITH...

target_encoded
0.03
lang_pct_coordinating_conjunction
0.03
lang_pct_preposition_subordinating_conjunction
0.02
lang_pct_cardinal_digit
0.02
lang_pct_verb_gerund_present_participle
0.02
lang_pct_noun_plural
0.02
lang_mean_words_per_sent
0.01
lang_pct_foreign_word
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_interjection
0.01
lang_pct_adjective
0.01
lang_pct_adverb_comparative
0.01
lang_pct_punct
0.01
form_rel_depth
0.01
style_table
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
92,556
93%
0.445
1
7,444
7%
0.261
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_table
PROVIDES INFORMATION ON...

style_list_num
0.03
style_toc
0.02
begins_with
0.01
para_prec_font_ind
0.01
form_font_family_mode_ind
0.01
para_foll_font_ind
0.01
style_cover_nm_add
0.01
is_upper
0.01
lang_ls_alnum
0.01
style_ans
0.01
para_prec_depth_ind
0.00
is_title
0.00
para_foll_depth_ind
0.00
style_indent
0.00

THESE FEATURES
GIVE INFORMATION
ON style_table:

style_list_num
0.04
begins_with
0.02
form_font_family_mode_ind
0.02
lang_ls_alnum
0.02
target
0.01
para_prec_font_ind
0.01
para_foll_font_ind
0.01
is_title
0.01
para_prec_depth_ind
0.01
para_foll_depth_ind
0.01
lang_ls_fs
0.01
is_upper
0.01
lang_ls_qm
0.01
style_toc
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_table
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.09
target_encoded
0.07
lang_pct_noun_singular
0.06
form_rel_font_size
0.06
lang_pct_preposition_subordinating_conjunction
0.05
lang_pct_possessive_pronoun
0.05
lang_pct_verb_3rd_person_sing_present
0.04
lang_pct_determiner
0.04
lang_pct_verb_base_form
0.04
lang_pct_wh_pronoun
0.03
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_coordinating_conjunction
0.03
lang_pct_punct
0.03
lang_pct_wh_abverb
0.03
style_list_num
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
83,465
83%
0.405
1
16,535
17%
0.566
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_list_num
PROVIDES INFORMATION ON...

begins_with
0.06
style_table
0.04
style_heading
0.04
style_toc
0.04
style_head_foot
0.03
style_box
0.03
style_ans
0.02
is_upper
0.02
style_title
0.02
is_title
0.02
style_indent
0.02
target
0.02
style_q
0.02
para_foll_depth_ind
0.01

THESE FEATURES
GIVE INFORMATION
ON style_list_num:

begins_with
0.09
target
0.03
style_table
0.03
is_title
0.02
para_foll_depth_ind
0.02
style_heading
0.02
lang_ls_alnum
0.02
para_prec_depth_ind
0.02
is_upper
0.01
lang_ls_qm
0.01
style_toc
0.01
lang_ls_fs
0.01
style_q
0.00
para_prec_size_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_list_num
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.10
target_encoded
0.09
lang_pct_preposition_subordinating_conjunction
0.09
lang_pct_wh_pronoun
0.07
lang_pct_cardinal_digit
0.06
lang_pct_proper_noun_singular
0.06
lang_pct_verb_base_form
0.06
lang_pct_coordinating_conjunction
0.06
lang_pct_wh_abverb
0.06
lang_pct_possessive_pronoun
0.06
lang_pct_verb_3rd_person_sing_present
0.05
lang_pct_noun_singular
0.05
lang_pct_verb_sing_present_non_third_person
0.05
lang_pct_to_infinitive_preposition
0.05
style_heading
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
94,136
94%
0.405
1
5,864
6%
0.861
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_heading
PROVIDES INFORMATION ON...

target
0.02
para_foll_font_ind
0.02
style_list_num
0.02
para_prec_font_ind
0.02
para_prec_size_ind
0.02
para_prec_depth_ind
0.02
para_foll_size_ind
0.02
para_foll_depth_ind
0.01
style_q
0.01
style_bullet
0.01
lang_ls_alnum
0.01
style_head_foot
0.01
begins_with
0.01
style_title
0.01

THESE FEATURES
GIVE INFORMATION
ON style_heading:

target
0.08
para_prec_depth_ind
0.04
style_list_num
0.04
para_foll_font_ind
0.03
para_prec_size_ind
0.03
lang_ls_alnum
0.03
para_foll_depth_ind
0.03
para_prec_font_ind
0.03
para_foll_size_ind
0.03
begins_with
0.03
lang_ls_fs
0.01
is_title
0.01
para_prec_colour_ind
0.01
style_q
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_heading
CORRELATION RATIO WITH...

target_encoded
0.17
lang_pct_proper_noun_singular
0.10
lang_mean_words_per_sent
0.08
lang_pct_punct
0.07
lang_pct_determiner
0.06
form_rel_font_size
0.06
lang_pct_noun_plural
0.06
lang_pct_verb_base_form
0.05
lang_pct_verb_3rd_person_sing_present
0.05
lang_pct_preposition_subordinating_conjunction
0.04
lang_pct_to_infinitive_preposition
0.04
lang_pct_adverb
0.04
lang_pct_verb_sing_present_non_third_person
0.04
lang_pct_modal
0.04
style_box
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,774
>99%
0.431
1
226
<1%
0.668
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_box
PROVIDES INFORMATION ON...

style_q
0.05
style_indent
0.00
form_font_family_mode_ind
0.00
style_list_num
0.00
is_bold
0.00
target
0.00
para_prec_bold_ind
0.00
is_italic
0.00
para_prec_italic_ind
0.00
para_foll_italic_ind
0.00
para_foll_bold_ind
0.00
style_toc
0.00
form_font_colour_mode_ind
0.00
is_upper
0.00

THESE FEATURES
GIVE INFORMATION
ON style_box:

style_q
0.30
form_font_family_mode_ind
0.04
target
0.04
is_bold
0.03
style_list_num
0.03
para_prec_bold_ind
0.02
para_foll_bold_ind
0.01
lang_ls_fs
0.01
form_font_colour_mode_ind
0.01
is_upper
0.01
para_foll_size_ind
0.01
para_foll_font_ind
0.00
lang_ls_alnum
0.00
para_foll_colour_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_box
CORRELATION RATIO WITH...

target_encoded
0.02
customer_pk
0.02
lang_pct_possessive_pronoun
0.02
lang_pct_proper_noun_singular
0.01
lang_pct_foreign_word
0.01
form_rel_depth
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_verb_gerund_present_participle
0.01
lang_pct_wh_determiner
0.01
lang_pct_modal
0.01
lang_pct_interjection
0.00
lang_pct_adjective
0.00
lang_pct_determiner
0.00
lang_pct_verb_past_participle
0.00
style_toc
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
98,420
98%
0.438
1
1,580
2%
0.021
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_toc
PROVIDES INFORMATION ON...

lang_ls_alnum
0.02
target
0.01
lang_ls_fs
0.01
style_list_num
0.01
lang_ls_qm
0.01
style_table
0.00
is_title
0.00
is_bold
0.00
lang_ls_clscl
0.00
lang_ls_brkt
0.00
begins_with
0.00
para_prec_bold_ind
0.00
style_q
0.00
para_foll_font_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON style_toc:

lang_ls_alnum
0.13
target
0.08
lang_ls_fs
0.05
style_list_num
0.04
is_title
0.03
begins_with
0.03
is_bold
0.03
lang_ls_qm
0.02
para_prec_bold_ind
0.02
para_foll_bold_ind
0.02
para_foll_font_ind
0.02
style_table
0.02
para_foll_depth_ind
0.01
para_prec_depth_ind
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_toc
CORRELATION RATIO WITH...

lang_pct_cardinal_digit
0.34
target_encoded
0.08
lang_pct_proper_noun_singular
0.06
lang_mean_words_per_sent
0.06
lang_pct_punct
0.06
lang_pct_noun_singular
0.05
lang_pct_determiner
0.05
lang_pct_verb_base_form
0.05
lang_pct_preposition_subordinating_conjunction
0.05
lang_pct_modal
0.03
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_to_infinitive_preposition
0.03
lang_pct_verb_past_participle
0.03
lang_pct_adjective
0.03
style_q
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
97,834
98%
0.422
1
2,166
2%
0.845
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_q
PROVIDES INFORMATION ON...

style_box
0.30
target
0.01
style_heading
0.01
lang_ls_qm
0.00
lang_ls_alnum
0.00
is_upper
0.00
style_toc
0.00
style_cover_nm_add
0.00
style_list_num
0.00
form_font_family_mode_ind
0.00
style_head_foot
0.00
is_title
0.00
style_ans
0.00
is_italic
0.00

THESE FEATURES
GIVE INFORMATION
ON style_q:

target
0.12
style_box
0.05
lang_ls_alnum
0.03
form_font_family_mode_ind
0.02
is_title
0.02
lang_ls_qm
0.02
style_list_num
0.02
lang_ls_fs
0.01
begins_with
0.01
style_heading
0.01
is_upper
0.01
para_prec_font_ind
0.00
form_font_colour_mode_ind
0.00
style_toc
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_q
CORRELATION RATIO WITH...

target_encoded
0.09
lang_pct_possessive_pronoun
0.07
lang_pct_proper_noun_singular
0.04
lang_pct_wh_abverb
0.04
lang_mean_words_per_sent
0.04
lang_pct_verb_base_form
0.04
form_rel_font_size
0.04
lang_pct_wh_pronoun
0.03
lang_pct_preposition_subordinating_conjunction
0.03
lang_pct_cardinal_digit
0.03
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_verb_3rd_person_sing_present
0.02
customer_pk
0.02
lang_pct_to_infinitive_preposition
0.02
style_ans
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,706
>99%
0.432
1
294
<1%
0.224
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_ans
PROVIDES INFORMATION ON...

style_indent
0.01
style_bullet
0.00
style_list_num
0.00
style_heading
0.00
style_q
0.00
style_toc
0.00
style_head_foot
0.00
style_title
0.00
style_table
0.00
style_box
0.00
para_prec_italic_ind
0.00
style_cover_nm_add
0.00
is_bold
0.00
is_upper
0.00

THESE FEATURES
GIVE INFORMATION
ON style_ans:

style_list_num
0.02
style_indent
0.02
target
0.01
style_heading
0.01
is_bold
0.01
lang_ls_fs
0.01
style_bullet
0.01
style_table
0.01
form_font_family_mode_ind
0.01
para_foll_depth_ind
0.01
begins_with
0.00
lang_ls_alnum
0.00
style_q
0.00
is_upper
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_ans
CORRELATION RATIO WITH...

lang_pct_verb_past_tense
0.02
lang_pct_determiner
0.02
target_encoded
0.02
lang_num_sents
0.01
lang_num_words
0.01
lang_pct_possessive_pronoun
0.01
form_rel_depth
0.01
lang_pct_possessive_ending
0.01
lang_pct_to_infinitive_preposition
0.01
lang_mean_words_per_sent
0.01
lang_pct_verb_sing_present_non_third_person
0.01
customer_pk
0.01
lang_pct_proper_noun_singular
0.01
form_rel_font_size
0.01
style_title
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,641
>99%
0.432
1
359
<1%
0.401
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_title
PROVIDES INFORMATION ON...

style_cover_nm_add
0.02
is_underline
0.00
para_foll_size_ind
0.00
para_prec_colour_ind
0.00
style_list_num
0.00
style_heading
0.00
lang_ls_alnum
0.00
para_prec_font_ind
0.00
style_bullet
0.00
style_indent
0.00
style_head_foot
0.00
style_ans
0.00
style_box
0.00
lang_ls_fs
0.00

THESE FEATURES
GIVE INFORMATION
ON style_title:

para_foll_size_ind
0.02
lang_ls_alnum
0.02
style_list_num
0.02
para_prec_colour_ind
0.01
target
0.01
para_prec_font_ind
0.01
lang_ls_fs
0.01
style_heading
0.01
is_underline
0.01
para_prec_size_ind
0.01
lang_ls_qm
0.01
para_foll_font_ind
0.01
is_title
0.00
para_prec_bold_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_title
CORRELATION RATIO WITH...

lang_pct_proper_noun_singular
0.04
form_rel_font_size
0.03
lang_mean_words_per_sent
0.02
lang_pct_determiner
0.02
lang_pct_to_infinitive_preposition
0.02
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_punct
0.01
lang_pct_verb_base_form
0.01
lang_pct_modal
0.01
lang_pct_proper_noun_plural
0.01
lang_pct_verb_past_participle
0.01
lang_pct_personal_pronoun
0.01
lang_pct_wh_abverb
0.01
lang_pct_verb_sing_present_non_third_person
0.01
style_indent
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,507
>99%
0.432
1
493
<1%
0.379
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_indent
PROVIDES INFORMATION ON...

style_ans
0.02
style_box
0.00
style_list_num
0.00
style_heading
0.00
style_toc
0.00
style_head_foot
0.00
style_title
0.00
style_cover_nm_add
0.00
style_q
0.00
style_bullet
0.00
para_prec_font_ind
0.00
style_table
0.00
para_foll_font_ind
0.00
form_font_family_mode_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON style_indent:

style_list_num
0.02
style_ans
0.01
style_heading
0.01
para_prec_font_ind
0.01
para_foll_font_ind
0.01
form_font_family_mode_ind
0.01
begins_with
0.00
style_table
0.00
lang_ls_alnum
0.00
lang_ls_fs
0.00
is_title
0.00
style_toc
0.00
target
0.00
is_upper
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_indent
CORRELATION RATIO WITH...

lang_pct_preposition_subordinating_conjunction
0.02
lang_mean_words_per_sent
0.02
customer_pk
0.02
form_rel_font_size
0.01
lang_pct_determiner
0.01
lang_pct_possessive_pronoun
0.01
lang_pct_proper_noun_singular
0.01
lang_pct_verb_sing_present_non_third_person
0.01
lang_pct_cardinal_digit
0.01
lang_pct_adverb_superlative
0.01
lang_pct_coordinating_conjunction
0.01
lang_pct_wh_determiner
0.01
lang_pct_verb_base_form
0.01
lang_pct_noun_singular
0.01
style_cover_nm_add
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,972
>99%
0.431
1
28
<1%
0.143
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_cover_nm_add
PROVIDES INFORMATION ON...

style_title
0.00
begins_with
0.00
form_font_family_mode_ind
0.00
style_q
0.00
para_prec_depth_ind
0.00
style_table
0.00
is_upper
0.00
style_heading
0.00
is_bold
0.00
para_prec_bold_ind
0.00
para_foll_bold_ind
0.00
is_italic
0.00
para_prec_italic_ind
0.00
para_foll_italic_ind
0.00

THESE FEATURES
GIVE INFORMATION
ON style_cover_nm_add:

begins_with
0.04
form_font_family_mode_ind
0.03
para_prec_depth_ind
0.02
style_title
0.02
target
0.02
is_bold
0.01
para_prec_bold_ind
0.01
para_foll_bold_ind
0.01
style_table
0.01
is_upper
0.01
style_heading
0.01
para_prec_size_ind
0.01
para_foll_depth_ind
0.00
lang_ls_alnum
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_cover_nm_add
CORRELATION RATIO WITH...

lang_pct_proper_noun_plural
0.01
lang_pct_proper_noun_singular
0.01
target_encoded
0.01
lang_pct_verb_gerund_present_participle
0.01
lang_pct_verb_base_form
0.01
lang_pct_noun_plural
0.01
lang_pct_verb_past_tense
0.01
lang_pct_adjective
0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_determiner
0.01
lang_pct_adjective_superlative
0.00
lang_pct_verb_sing_present_non_third_person
0.00
lang_pct_personal_pronoun
0.00
lang_mean_words_per_sent
0.00
style_head_foot
MISSING:
---
TOP CATEGORIES

target_encoded
(avg)
0
99,510
>99%
0.431
1
490
<1%
0.584
ALL
100,000
100%
0.431
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
style_head_foot
PROVIDES INFORMATION ON...

style_list_num
0.00
style_heading
0.00
style_bullet
0.00
style_q
0.00
style_indent
0.00
style_title
0.00
style_ans
0.00
para_prec_colour_ind
0.00
style_cover_nm_add
0.00
is_upper
0.00
lang_ls_alnum
0.00
is_title
0.00
target
0.00
is_italic
0.00

THESE FEATURES
GIVE INFORMATION
ON style_head_foot:

style_list_num
0.03
style_heading
0.01
target
0.01
lang_ls_alnum
0.01
para_prec_colour_ind
0.00
is_title
0.00
style_q
0.00
is_upper
0.00
style_bullet
0.00
lang_ls_fs
0.00
para_prec_size_ind
0.00
para_prec_bold_ind
0.00
para_foll_colour_ind
0.00
para_foll_size_ind
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
style_head_foot
CORRELATION RATIO WITH...

lang_mean_words_per_sent
0.02
target_encoded
0.02
lang_pct_noun_singular
0.02
lang_pct_to_infinitive_preposition
0.01
lang_pct_modal
0.01
lang_pct_verb_sing_present_non_third_person
0.01
customer_pk
0.01
lang_pct_verb_3rd_person_sing_present
0.01
lang_pct_sym
0.01
lang_pct_personal_pronoun
0.01
lang_pct_verb_base_form
0.01
form_rel_font_size
0.01
lang_pct_determiner
0.01
lang_pct_possessive_pronoun
0.01
lang_pct_coordinating_conjunction
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.21
lang_pct_noun_singular
-0.13
target_encoded
0.09
lang_pct_cardinal_digit
-0.08
lang_pct_punct
-0.06
lang_pct_verb_base_form
0.05
lang_pct_interjection
-0.05
lang_pct_to_infinitive_preposition
0.04
lang_pct_noun_plural
0.03
lang_pct_foreign_word
-0.03
lang_num_words
0.03
lang_pct_wh_determiner
0.02
lang_pct_preposition_subordinating_conjunction
0.02
lang_pct_proper_noun_plural
0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.24
lang_ls_fs
0.13
is_upper
0.10
target
0.10
lang_ls_alnum
0.07
style_list_num
0.06
begins_with
0.05
lang_ls_brkt
0.05
lang_ls_clscl
0.04
is_bold
0.04
style_table
0.03
style_heading
0.03
style_bullet
0.03
para_prec_depth_ind
0.03
MOST FREQUENT VALUES

0.0
67,567
67.6%
0.07692307692307693
1,047
1.0%
0.07142857142857142
1,034
1.0%
0.08333333333333333
1,027
1.0%
0.06666666666666667
1,009
1.0%
0.09090909090909091
1,006
1.0%
0.1
959
1.0%
0.25
952
1.0%
0.0625
942
0.9%
0.125
935
0.9%
0.1111111111111111
908
0.9%
0.058823529411764705
884
0.9%
0.05555555555555555
844
0.8%
0.2
843
0.8%
0.14285714285714285
826
0.8%
SMALLEST VALUES

0.0
67,567
67.6%
0.0015384615384615385
1
<0.1%
0.0033112582781456954
1
<0.1%
0.0035842293906810036
1
<0.1%
0.004830917874396135
1
<0.1%
0.005376344086021506
1
<0.1%
0.00558659217877095
2
<0.1%
0.0058823529411764705
1
<0.1%
0.005917159763313609
1
<0.1%
0.005952380952380952
1
<0.1%
0.006060606060606061
2
<0.1%
0.006993006993006993
1
<0.1%
0.007142857142857143
1
<0.1%
0.0072992700729927005
1
<0.1%
0.007352941176470588
1
<0.1%
LARGEST VALUES

1.0
11
<0.1%
0.5
17
<0.1%
0.4
12
<0.1%
0.3333333333333333
509
0.5%
0.3
2
<0.1%
0.2857142857142857
14
<0.1%
0.2727272727272727
3
<0.1%
0.26666666666666666
1
<0.1%
0.25
952
1.0%
0.23529411764705882
4
<0.1%
0.23076923076923078
6
<0.1%
0.2222222222222222
50
<0.1%
0.21739130434782608
2
<0.1%
0.21428571428571427
7
<0.1%
0.20833333333333334
1
<0.1%
lang_pct_cardinal_digit
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.12
target_encoded
-0.11
lang_pct_preposition_subordinating_conjunction
-0.10
lang_pct_verb_base_form
-0.10
lang_pct_determiner
-0.10
lang_mean_words_per_sent
-0.09
lang_pct_coordinating_conjunction
-0.08
lang_pct_to_infinitive_preposition
-0.07
lang_pct_verb_past_participle
-0.06
lang_pct_possessive_pronoun
-0.06
lang_pct_adjective
-0.06
lang_pct_modal
-0.06
lang_pct_verb_sing_present_non_third_person
-0.06
lang_pct_personal_pronoun
-0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

style_toc
0.34
lang_ls_alnum
0.16
target
0.14
lang_ls_fs
0.11
is_title
0.10
lang_ls_qm
0.09
style_list_num
0.06
is_upper
0.06
lang_ls_clscl
0.05
begins_with
0.05
para_prec_depth_ind
0.05
form_font_family_mode_ind
0.04
para_foll_depth_ind
0.04
para_prec_font_ind
0.03
MOST FREQUENT VALUES

0.0
83,982
84.0%
0.5
1,316
1.3%
0.3333333333333333
1,260
1.3%
0.25
1,070
1.1%
0.2
733
0.7%
0.16666666666666666
694
0.7%
0.125
656
0.7%
0.14285714285714285
489
0.5%
0.1
336
0.3%
0.1111111111111111
325
0.3%
0.06666666666666667
306
0.3%
0.08333333333333333
270
0.3%
0.09090909090909091
264
0.3%
0.07142857142857142
245
0.2%
0.0625
238
0.2%
SMALLEST VALUES

0.0
83,982
84.0%
0.0009606147934678194
1
<0.1%
0.0011682242990654205
1
<0.1%
0.0012396694214876034
1
<0.1%
0.0013003901170351106
1
<0.1%
0.0014388489208633094
1
<0.1%
0.0016
1
<0.1%
0.001694915254237288
1
<0.1%
0.0018726591760299626
1
<0.1%
0.0019193857965451055
1
<0.1%
0.001941747572815534
1
<0.1%
0.0020491803278688526
1
<0.1%
0.0022123893805309734
1
<0.1%
0.0022560631697687537
1
<0.1%
0.002325581395348837
1
<0.1%
LARGEST VALUES

1.0
218
0.2%
0.8125
3
<0.1%
0.75
4
<0.1%
0.6666666666666666
112
0.1%
0.6363636363636364
2
<0.1%
0.625
2
<0.1%
0.6153846153846154
1
<0.1%
0.6
11
<0.1%
0.5714285714285714
4
<0.1%
0.5555555555555556
1
<0.1%
0.5333333333333333
1
<0.1%
0.5
1,316
1.3%
0.47619047619047616
1
<0.1%
0.4666666666666667
1
<0.1%
0.45454545454545453
1
<0.1%
lang_pct_determiner
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_proper_noun_singular
-0.19
lang_pct_noun_singular
-0.18
lang_mean_words_per_sent
0.17
lang_pct_preposition_subordinating_conjunction
0.13
lang_pct_verb_3rd_person_sing_present
0.12
lang_pct_cardinal_digit
-0.10
lang_pct_modal
0.09
lang_pct_noun_plural
-0.09
lang_pct_verb_base_form
0.07
lang_pct_to_infinitive_preposition
0.07
lang_pct_adjective
-0.06
lang_pct_wh_determiner
0.06
lang_pct_punct
-0.05
lang_pct_existential_there
0.04

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_fs
0.15
lang_ls_alnum
0.15
target
0.11
is_upper
0.09
is_title
0.07
lang_ls_qm
0.07
is_bold
0.07
style_heading
0.06
style_toc
0.05
form_font_colour_mode_ind
0.05
style_table
0.04
para_foll_depth_ind
0.04
lang_ls_brkt
0.04
para_foll_font_ind
0.04
MOST FREQUENT VALUES

0.0
58,122
58.1%
0.125
1,880
1.9%
0.14285714285714285
1,854
1.9%
0.1111111111111111
1,697
1.7%
0.1
1,631
1.6%
0.09090909090909091
1,461
1.5%
0.16666666666666666
1,437
1.4%
0.08333333333333333
1,368
1.4%
0.07692307692307693
1,223
1.2%
0.2
1,174
1.2%
0.07142857142857142
1,103
1.1%
0.06666666666666667
949
0.9%
0.0625
817
0.8%
1.0
749
0.7%
0.25
738
0.7%
SMALLEST VALUES

0.0
58,122
58.1%
0.006097560975609756
1
<0.1%
0.006369426751592357
4
<0.1%
0.006993006993006993
2
<0.1%
0.007751937984496124
1
<0.1%
0.008
1
<0.1%
0.008264462809917356
1
<0.1%
0.008849557522123894
1
<0.1%
0.009433962264150943
8
<0.1%
0.00966183574879227
1
<0.1%
0.009900990099009901
1
<0.1%
0.010416666666666666
1
<0.1%
0.010752688172043012
1
<0.1%
0.011235955056179775
1
<0.1%
0.011494252873563218
2
<0.1%
LARGEST VALUES

1.0
749
0.7%
0.5
487
0.5%
0.4
10
<0.1%
0.375
3
<0.1%
0.3333333333333333
389
0.4%
0.3076923076923077
10
<0.1%
0.3
11
<0.1%
0.29411764705882354
3
<0.1%
0.2916666666666667
1
<0.1%
0.2857142857142857
72
<0.1%
0.28
1
<0.1%
0.2777777777777778
4
<0.1%
0.2727272727272727
40
<0.1%
0.26666666666666666
33
<0.1%
0.2631578947368421
4
<0.1%
lang_pct_existential_there
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_3rd_person_sing_present
0.11
target_encoded
0.05
lang_pct_determiner
0.04
lang_pct_proper_noun_singular
-0.04
lang_pct_preposition_subordinating_conjunction
0.03
lang_mean_words_per_sent
0.03
lang_pct_verb_sing_present_non_third_person
0.03
lang_pct_noun_singular
-0.02
form_rel_font_size
-0.02
lang_pct_cardinal_digit
-0.02
lang_pct_wh_determiner
0.02
lang_pct_to_infinitive_preposition
0.02
lang_pct_verb_past_participle
0.01
lang_pct_interjection
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.14
target
0.09
lang_ls_alnum
0.08
is_title
0.06
is_upper
0.03
begins_with
0.02
style_list_num
0.02
form_font_family_mode_ind
0.02
is_bold
0.02
form_font_colour_mode_ind
0.02
lang_ls_brkt
0.02
lang_ls_clscl
0.02
style_heading
0.02
para_foll_size_ind
0.01
MOST FREQUENT VALUES

0.0
98,432
98.4%
0.05555555555555555
53
<0.1%
0.06666666666666667
50
<0.1%
0.0625
48
<0.1%
0.07142857142857142
45
<0.1%
0.058823529411764705
44
<0.1%
0.07692307692307693
43
<0.1%
0.08333333333333333
41
<0.1%
0.09090909090909091
39
<0.1%
0.1
39
<0.1%
0.037037037037037035
38
<0.1%
0.04
35
<0.1%
0.05
34
<0.1%
0.05263157894736842
34
<0.1%
0.043478260869565216
31
<0.1%
SMALLEST VALUES

0.0
98,432
98.4%
0.00011368804001819008
1
<0.1%
0.00014569825890580607
1
<0.1%
0.00020968756552736424
1
<0.1%
0.00023110700254217703
1
<0.1%
0.0002314814814814815
1
<0.1%
0.0002646202699126753
1
<0.1%
0.00026518164942985947
1
<0.1%
0.00026867275658248256
1
<0.1%
0.00027214587018641994
1
<0.1%
0.0002755580049600441
1
<0.1%
0.0002828854314002829
1
<0.1%
0.00028328611898016995
1
<0.1%
0.00028768699654775604
1
<0.1%
0.0002891844997108155
1
<0.1%
LARGEST VALUES

0.2
4
<0.1%
0.16666666666666666
12
<0.1%
0.14285714285714285
26
<0.1%
0.125
26
<0.1%
0.1111111111111111
29
<0.1%
0.10526315789473684
1
<0.1%
0.1
39
<0.1%
0.09090909090909091
39
<0.1%
0.08695652173913043
1
<0.1%
0.08333333333333333
41
<0.1%
0.08
1
<0.1%
0.07692307692307693
43
<0.1%
0.07407407407407407
1
<0.1%
0.07142857142857142
45
<0.1%
0.06666666666666667
50
<0.1%
lang_pct_foreign_word
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.04
lang_pct_punct
0.04
lang_pct_determiner
-0.03
lang_pct_coordinating_conjunction
-0.03
lang_pct_verb_base_form
-0.02
lang_pct_preposition_subordinating_conjunction
-0.02
lang_pct_to_infinitive_preposition
-0.02
lang_pct_verb_sing_present_non_third_person
0.02
lang_pct_verb_past_participle
-0.02
lang_pct_cardinal_digit
-0.02
lang_pct_proper_noun_singular
-0.02
lang_pct_modal
-0.02
lang_pct_possessive_pronoun
-0.02
lang_pct_adverb
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.05
lang_ls_alnum
0.04
lang_ls_fs
0.03
lang_ls_brkt
0.03
style_list_num
0.02
is_upper
0.02
target
0.02
style_heading
0.02
form_font_colour_mode_ind
0.01
style_bullet
0.01
style_toc
0.01
is_bold
0.01
para_foll_colour_ind
0.01
style_box
0.01
MOST FREQUENT VALUES

0.0
97,627
97.6%
0.038461538461538464
68
<0.1%
0.047619047619047616
64
<0.1%
0.05555555555555555
59
<0.1%
0.05
56
<0.1%
0.045454545454545456
48
<0.1%
0.043478260869565216
47
<0.1%
0.034482758620689655
47
<0.1%
0.05263157894736842
45
<0.1%
0.07692307692307693
44
<0.1%
0.058823529411764705
43
<0.1%
0.07142857142857142
43
<0.1%
0.03225806451612903
41
<0.1%
0.03333333333333333
39
<0.1%
0.0625
39
<0.1%
SMALLEST VALUES

0.0
97,627
97.6%
0.0001644195988161789
1
<0.1%
0.00016645859342488556
1
<0.1%
0.00016655562958027982
1
<0.1%
0.0001741098633237573
2
<0.1%
0.00017491691446562882
1
<0.1%
0.00021826912583215103
1
<0.1%
0.00022232103156958648
1
<0.1%
0.0002244668911335578
1
<0.1%
0.00023110700254217703
1
<0.1%
0.0002314814814814815
1
<0.1%
0.00023806689679800025
1
<0.1%
0.00023820867079561695
1
<0.1%
0.00023929169657812874
1
<0.1%
0.00024316109422492402
1
<0.1%
LARGEST VALUES

0.75
1
<0.1%
0.7333333333333333
1
<0.1%
0.6666666666666666
1
<0.1%
0.65
1
<0.1%
0.625
1
<0.1%
0.6
2
<0.1%
0.5833333333333334
1
<0.1%
0.5714285714285714
3
<0.1%
0.5555555555555556
1
<0.1%
0.5263157894736842
1
<0.1%
0.5185185185185185
1
<0.1%
0.5
17
<0.1%
0.47619047619047616
1
<0.1%
0.47058823529411764
1
<0.1%
0.4583333333333333
1
<0.1%
lang_pct_preposition_subordinating_conjunction
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.34
lang_pct_proper_noun_singular
-0.19
lang_pct_noun_singular
-0.17
lang_pct_determiner
0.13
lang_pct_verb_past_participle
0.11
lang_pct_cardinal_digit
-0.10
lang_pct_verb_base_form
0.10
lang_pct_modal
0.09
lang_pct_possessive_pronoun
0.08
lang_pct_interjection
-0.08
lang_pct_verb_3rd_person_sing_present
0.07
lang_pct_wh_determiner
0.07
lang_pct_personal_pronoun
0.06
lang_pct_wh_pronoun
0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.40
lang_ls_alnum
0.25
lang_ls_fs
0.25
target
0.20
is_upper
0.18
lang_ls_qm
0.09
style_list_num
0.09
is_bold
0.09
form_font_colour_mode_ind
0.06
lang_ls_brkt
0.05
style_toc
0.05
style_table
0.05
style_heading
0.04
begins_with
0.04
MOST FREQUENT VALUES

0.0
52,650
52.6%
0.125
2,297
2.3%
0.14285714285714285
2,235
2.2%
0.1111111111111111
2,169
2.2%
0.16666666666666666
2,143
2.1%
0.1
2,058
2.1%
0.09090909090909091
1,777
1.8%
0.2
1,666
1.7%
0.08333333333333333
1,549
1.5%
0.07692307692307693
1,388
1.4%
0.25
1,331
1.3%
0.07142857142857142
1,192
1.2%
0.3333333333333333
1,094
1.1%
0.06666666666666667
957
1.0%
0.0625
874
0.9%
SMALLEST VALUES

0.0
52,650
52.6%
0.011111111111111112
1
<0.1%
0.011627906976744186
1
<0.1%
0.012345679012345678
1
<0.1%
0.0125
1
<0.1%
0.012658227848101266
1
<0.1%
0.012987012987012988
2
<0.1%
0.013157894736842105
2
<0.1%
0.013333333333333334
1
<0.1%
0.013888888888888888
1
<0.1%
0.014285714285714285
2
<0.1%
0.014492753623188406
2
<0.1%
0.014925373134328358
1
<0.1%
0.015151515151515152
2
<0.1%
0.015384615384615385
2
<0.1%
LARGEST VALUES

1.0
41
<0.1%
0.6666666666666666
8
<0.1%
0.6
1
<0.1%
0.5714285714285714
1
<0.1%
0.5
167
0.2%
0.4444444444444444
2
<0.1%
0.42857142857142855
5
<0.1%
0.4
62
<0.1%
0.38461538461538464
1
<0.1%
0.375
19
<0.1%
0.3333333333333333
1,094
1.1%
0.3125
2
<0.1%
0.3076923076923077
15
<0.1%
0.30434782608695654
1
<0.1%
0.3
44
<0.1%
lang_pct_adjective
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.16
lang_pct_proper_noun_singular
-0.16
lang_pct_determiner
-0.06
lang_pct_punct
-0.06
lang_pct_cardinal_digit
-0.06
lang_pct_interjection
-0.05
lang_pct_verb_past_participle
-0.04
lang_pct_verb_gerund_present_participle
-0.03
lang_pct_preposition_subordinating_conjunction
-0.03
lang_pct_verb_base_form
-0.03
lang_pct_adverb_superlative
0.02
lang_pct_coordinating_conjunction
-0.02
lang_pct_verb_past_tense
-0.02
lang_pct_personal_pronoun
-0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.08
is_bold
0.03
style_toc
0.03
target
0.03
style_list_num
0.02
style_heading
0.02
para_prec_size_ind
0.02
para_prec_depth_ind
0.02
para_prec_colour_ind
0.02
para_prec_font_ind
0.02
para_prec_bold_ind
0.01
lang_ls_brkt
0.01
para_foll_size_ind
0.01
style_bullet
0.01
MOST FREQUENT VALUES

0.0
55,272
55.3%
0.3333333333333333
1,728
1.7%
0.5
1,675
1.7%
0.14285714285714285
1,553
1.6%
0.16666666666666666
1,552
1.6%
0.1
1,536
1.5%
0.1111111111111111
1,536
1.5%
0.125
1,522
1.5%
0.2
1,454
1.5%
0.09090909090909091
1,447
1.4%
0.25
1,405
1.4%
0.08333333333333333
1,345
1.3%
0.07692307692307693
1,291
1.3%
0.07142857142857142
1,173
1.2%
0.06666666666666667
1,047
1.0%
SMALLEST VALUES

0.0
55,272
55.3%
0.006535947712418301
1
<0.1%
0.008130081300813009
1
<0.1%
0.00819672131147541
1
<0.1%
0.008333333333333333
1
<0.1%
0.00909090909090909
1
<0.1%
0.009230769230769232
1
<0.1%
0.009345794392523364
1
<0.1%
0.009708737864077669
2
<0.1%
0.009900990099009901
1
<0.1%
0.01
2
<0.1%
0.010101010101010102
1
<0.1%
0.01020408163265306
1
<0.1%
0.010526315789473684
1
<0.1%
0.010638297872340425
2
<0.1%
LARGEST VALUES

1.0
707
0.7%
0.75
5
<0.1%
0.6666666666666666
45
<0.1%
0.6
5
<0.1%
0.5
1,675
1.7%
0.4444444444444444
3
<0.1%
0.42857142857142855
9
<0.1%
0.4
130
0.1%
0.38461538461538464
1
<0.1%
0.375
24
<0.1%
0.36363636363636365
6
<0.1%
0.35714285714285715
1
<0.1%
0.3333333333333333
1,728
1.7%
0.3157894736842105
2
<0.1%
0.3125
5
<0.1%
lang_pct_adjective_comparative
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_preposition_subordinating_conjunction
0.03
lang_pct_cardinal_digit
0.03
lang_pct_proper_noun_singular
-0.03
lang_pct_noun_singular
-0.03
lang_mean_words_per_sent
0.02
lang_pct_coordinating_conjunction
0.02
lang_pct_list_marker
0.02
lang_pct_verb_base_form
0.02
lang_pct_adjective
-0.01
target_encoded
-0.01
lang_pct_interjection
-0.01
lang_pct_modal
0.01
lang_pct_punct
-0.01
form_rel_font_size
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.02
is_upper
0.02
lang_ls_fs
0.02
para_foll_size_ind
0.01
para_prec_size_ind
0.01
target
0.01
begins_with
0.01
style_heading
0.01
style_toc
0.01
lang_ls_qm
0.01
lang_ls_alnum
0.01
is_bold
0.01
style_list_num
0.01
para_prec_depth_ind
0.00
MOST FREQUENT VALUES

0.0
98,228
98.2%
0.058823529411764705
35
<0.1%
0.038461538461538464
32
<0.1%
0.0625
32
<0.1%
0.07142857142857142
31
<0.1%
0.05555555555555555
31
<0.1%
0.047619047619047616
30
<0.1%
0.06666666666666667
29
<0.1%
0.125
27
<0.1%
0.05263157894736842
26
<0.1%
0.08333333333333333
26
<0.1%
0.03125
24
<0.1%
0.25
24
<0.1%
0.04
23
<0.1%
0.07692307692307693
23
<0.1%
SMALLEST VALUES

0.0
98,228
98.2%
0.0001431229426077
1
<0.1%
0.00015508684863523573
1
<0.1%
0.00016155088852988692
1
<0.1%
0.00017917935853789643
1
<0.1%
0.00018086453246518358
1
<0.1%
0.00018231540565177758
1
<0.1%
0.00022742779167614282
1
<0.1%
0.0002547987090198743
1
<0.1%
0.0002646202699126753
1
<0.1%
0.0002671653753673524
1
<0.1%
0.0002763957987838585
1
<0.1%
0.00029708853238265005
1
<0.1%
0.00031938677738741617
1
<0.1%
0.0003204101249599487
1
<0.1%
LARGEST VALUES

0.5
17
<0.1%
0.3333333333333333
7
<0.1%
0.25
24
<0.1%
0.2
17
<0.1%
0.18181818181818182
2
<0.1%
0.16666666666666666
13
<0.1%
0.15
1
<0.1%
0.14285714285714285
18
<0.1%
0.125
27
<0.1%
0.1111111111111111
18
<0.1%
0.10526315789473684
1
<0.1%
0.1
14
<0.1%
0.09090909090909091
18
<0.1%
0.08695652173913043
1
<0.1%
0.08333333333333333
26
<0.1%
lang_pct_adjective_superlative
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_proper_noun_singular
-0.03
lang_pct_preposition_subordinating_conjunction
0.02
lang_mean_words_per_sent
0.02
lang_pct_noun_singular
-0.02
lang_pct_possessive_wh_pronoun
0.01
lang_pct_wh_determiner
0.01
lang_pct_interjection
-0.01
lang_pct_possessive_pronoun
0.01
lang_pct_foreign_word
0.01
lang_pct_adverb_superlative
0.01
lang_pct_noun_plural
-0.01
lang_pct_determiner
0.00
lang_num_words
0.00
lang_pct_punct
-0.00

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.02
is_upper
0.02
is_title
0.02
lang_ls_fs
0.01
target
0.01
is_bold
0.01
lang_ls_qm
0.01
is_underline
0.01
style_title
0.01
para_prec_size_ind
0.01
is_italic
0.01
para_foll_depth_ind
0.00
para_prec_bold_ind
0.00
style_cover_nm_add
0.00
MOST FREQUENT VALUES

0.0
98,222
98.2%
0.07142857142857142
53
<0.1%
0.06666666666666667
37
<0.1%
0.0625
34
<0.1%
0.07692307692307693
33
<0.1%
0.05555555555555555
32
<0.1%
0.045454545454545456
31
<0.1%
0.04
31
<0.1%
0.09090909090909091
30
<0.1%
0.058823529411764705
30
<0.1%
0.03333333333333333
27
<0.1%
0.05
25
<0.1%
0.041666666666666664
25
<0.1%
0.05263157894736842
24
<0.1%
0.037037037037037035
24
<0.1%
SMALLEST VALUES

0.0
98,222
98.2%
0.0001467351430667645
2
<0.1%
0.0002755580049600441
1
<0.1%
0.00032
1
<0.1%
0.00033973161202649905
1
<0.1%
0.0003397893306150187
1
<0.1%
0.00034094783498124785
1
<0.1%
0.00034965034965034965
1
<0.1%
0.00038109756097560977
1
<0.1%
0.00038491147036181676
1
<0.1%
0.00038684719535783365
1
<0.1%
0.00041118421052631577
1
<0.1%
0.0004317789291882556
1
<0.1%
0.00045558086560364467
2
<0.1%
0.00046221400508435407
1
<0.1%
LARGEST VALUES

1.0
5
<0.1%
0.5
7
<0.1%
0.3333333333333333
13
<0.1%
0.25
16
<0.1%
0.2
10
<0.1%
0.16666666666666666
13
<0.1%
0.14285714285714285
10
<0.1%
0.125
14
<0.1%
0.1111111111111111
14
<0.1%
0.1
21
<0.1%
0.09523809523809523
2
<0.1%
0.09090909090909091
30
<0.1%
0.08695652173913043
1
<0.1%
0.08333333333333333
21
<0.1%
0.08
2
<0.1%
lang_pct_list_marker
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_adjective_comparative
0.02
lang_pct_punct
0.02
customer_pk
-0.01
lang_mean_words_per_sent
-0.01
lang_pct_noun_plural
-0.00
lang_pct_coordinating_conjunction
-0.00
lang_pct_verb_base_form
-0.00
lang_pct_determiner
-0.00
lang_pct_preposition_subordinating_conjunction
-0.00
lang_pct_proper_noun_singular
-0.00
lang_pct_modal
-0.00
lang_pct_noun_singular
-0.00
lang_pct_adverb
-0.00
form_rel_depth
-0.00

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.01
para_foll_size_ind
0.01
para_prec_size_ind
0.01
para_prec_font_ind
0.01
style_title
0.01
begins_with
0.01
para_prec_depth_ind
0.01
style_list_num
0.01
para_foll_font_ind
0.01
para_foll_depth_ind
0.01
lang_ls_fs
0.00
form_font_family_mode_ind
0.00
target
0.00
lang_ls_brkt
0.00
MOST FREQUENT VALUES

0.0
99,974
>99.9%
0.16666666666666666
3
<0.1%
0.3333333333333333
2
<0.1%
0.1111111111111111
2
<0.1%
0.2
2
<0.1%
0.05263157894736842
2
<0.1%
0.125
1
<0.1%
0.019230769230769232
1
<0.1%
0.01694915254237288
1
<0.1%
0.1
1
<0.1%
0.022727272727272728
1
<0.1%
0.025
1
<0.1%
0.01282051282051282
1
<0.1%
0.018867924528301886
1
<0.1%
0.041666666666666664
1
<0.1%
SMALLEST VALUES

0.0
99,974
>99.9%
0.01282051282051282
1
<0.1%
0.01694915254237288
1
<0.1%
0.018867924528301886
1
<0.1%
0.019230769230769232
1
<0.1%
0.0196078431372549
1
<0.1%
0.022727272727272728
1
<0.1%
0.025
1
<0.1%
0.02631578947368421
1
<0.1%
0.037037037037037035
1
<0.1%
0.041666666666666664
1
<0.1%
0.05263157894736842
2
<0.1%
0.07692307692307693
1
<0.1%
0.08333333333333333
1
<0.1%
0.1
1
<0.1%
LARGEST VALUES

0.3333333333333333
2
<0.1%
0.2
2
<0.1%
0.16666666666666666
3
<0.1%
0.14285714285714285
1
<0.1%
0.125
1
<0.1%
0.1111111111111111
2
<0.1%
0.1
1
<0.1%
0.08333333333333333
1
<0.1%
0.07692307692307693
1
<0.1%
0.05263157894736842
2
<0.1%
0.041666666666666664
1
<0.1%
0.037037037037037035
1
<0.1%
0.02631578947368421
1
<0.1%
0.025
1
<0.1%
0.022727272727272728
1
<0.1%
lang_pct_modal
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_base_form
0.32
lang_mean_words_per_sent
0.18
lang_pct_proper_noun_singular
-0.11
lang_pct_noun_singular
-0.10
lang_pct_wh_determiner
0.10
lang_pct_preposition_subordinating_conjunction
0.09
lang_pct_verb_past_participle
0.09
lang_pct_personal_pronoun
0.09
lang_pct_determiner
0.09
lang_pct_to_infinitive_preposition
0.09
lang_pct_wh_abverb
0.06
lang_pct_cardinal_digit
-0.06
lang_pct_adverb
0.04
lang_num_words
0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_fs
0.17
lang_ls_alnum
0.17
is_title
0.15
target
0.10
is_upper
0.07
is_bold
0.07
lang_ls_qm
0.06
para_foll_depth_ind
0.05
lang_ls_brkt
0.04
style_heading
0.04
form_font_colour_mode_ind
0.04
style_toc
0.03
style_list_num
0.03
para_prec_underline_ind
0.03
MOST FREQUENT VALUES

0.0
85,543
85.5%
0.058823529411764705
412
0.4%
0.05555555555555555
396
0.4%
0.07142857142857142
392
0.4%
0.06666666666666667
389
0.4%
0.0625
376
0.4%
0.05263157894736842
362
0.4%
0.07692307692307693
361
0.4%
0.047619047619047616
359
0.4%
0.08333333333333333
348
0.3%
0.05
342
0.3%
0.045454545454545456
342
0.3%
0.09090909090909091
313
0.3%
0.1
312
0.3%
0.043478260869565216
306
0.3%
SMALLEST VALUES

0.0
85,543
85.5%
0.00046728971962616824
1
<0.1%
0.0004757373929590866
1
<0.1%
0.0004837929366231253
1
<0.1%
0.0005659309564233164
1
<0.1%
0.001145475372279496
1
<0.1%
0.0012081463583016915
1
<0.1%
0.001295672454003628
1
<0.1%
0.0013126230584117262
1
<0.1%
0.0013259082471492973
1
<0.1%
0.0014384349827387803
1
<0.1%
0.001440922190201729
1
<0.1%
0.001445086705202312
1
<0.1%
0.0014563106796116505
1
<0.1%
0.0014705882352941176
1
<0.1%
LARGEST VALUES

1.0
15
<0.1%
0.5
10
<0.1%
0.3333333333333333
20
<0.1%
0.25
33
<0.1%
0.2222222222222222
1
<0.1%
0.2
34
<0.1%
0.18181818181818182
2
<0.1%
0.17647058823529413
1
<0.1%
0.16666666666666666
68
<0.1%
0.15384615384615385
9
<0.1%
0.15
1
<0.1%
0.14285714285714285
122
0.1%
0.13793103448275862
1
<0.1%
0.13513513513513514
1
<0.1%
0.13333333333333333
13
<0.1%
lang_pct_noun_singular
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_proper_noun_singular
-0.36
lang_pct_noun_plural
-0.22
lang_mean_words_per_sent
-0.22
lang_pct_punct
-0.19
lang_pct_determiner
-0.18
lang_pct_preposition_subordinating_conjunction
-0.17
lang_pct_adjective
-0.16
lang_pct_verb_base_form
-0.15
lang_pct_coordinating_conjunction
-0.13
lang_pct_verb_sing_present_non_third_person
-0.13
lang_pct_verb_past_participle
-0.12
lang_pct_cardinal_digit
-0.12
lang_pct_to_infinitive_preposition
-0.12
lang_pct_adverb
-0.11

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.28
lang_ls_alnum
0.24
is_title
0.20
lang_ls_fs
0.15
target
0.11
lang_ls_qm
0.09
lang_ls_brkt
0.09
is_bold
0.07
style_table
0.06
style_toc
0.05
begins_with
0.05
style_list_num
0.05
para_foll_depth_ind
0.05
form_font_colour_mode_ind
0.04
MOST FREQUENT VALUES

0.0
24,943
24.9%
1.0
8,950
8.9%
0.5
6,170
6.2%
0.3333333333333333
4,801
4.8%
0.25
4,118
4.1%
0.2
3,306
3.3%
0.16666666666666666
2,758
2.8%
0.14285714285714285
2,266
2.3%
0.125
1,793
1.8%
0.1111111111111111
1,410
1.4%
0.2857142857142857
1,279
1.3%
0.6666666666666666
1,233
1.2%
0.2222222222222222
1,213
1.2%
0.1
1,154
1.2%
0.18181818181818182
1,052
1.1%
SMALLEST VALUES

0.0
24,943
24.9%
0.0015384615384615385
1
<0.1%
0.009615384615384616
1
<0.1%
0.012048192771084338
1
<0.1%
0.012658227848101266
1
<0.1%
0.012987012987012988
1
<0.1%
0.0136986301369863
1
<0.1%
0.013888888888888888
1
<0.1%
0.014084507042253521
1
<0.1%
0.014705882352941176
1
<0.1%
0.014925373134328358
1
<0.1%
0.015384615384615385
1
<0.1%
0.017241379310344827
1
<0.1%
0.017543859649122806
2
<0.1%
0.01818181818181818
1
<0.1%
LARGEST VALUES

1.0
8,950
8.9%
0.8571428571428571
2
<0.1%
0.8461538461538461
1
<0.1%
0.8333333333333334
13
<0.1%
0.8
47
<0.1%
0.7777777777777778
3
<0.1%
0.75
163
0.2%
0.7272727272727273
1
<0.1%
0.7142857142857143
18
<0.1%
0.7
2
<0.1%
0.6923076923076923
3
<0.1%
0.6666666666666666
1,233
1.2%
0.6428571428571429
1
<0.1%
0.6363636363636364
5
<0.1%
0.625
28
<0.1%
lang_pct_noun_plural
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.22
lang_pct_proper_noun_singular
-0.20
lang_pct_verb_sing_present_non_third_person
0.09
lang_pct_determiner
-0.09
lang_pct_punct
-0.08
lang_pct_verb_3rd_person_sing_present
-0.06
target_encoded
0.06
lang_pct_interjection
-0.06
lang_pct_cardinal_digit
-0.05
lang_pct_adverb
-0.04
lang_pct_coordinating_conjunction
0.03
lang_pct_personal_pronoun
-0.03
lang_mean_words_per_sent
-0.02
lang_pct_proper_noun_plural
-0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.11
target
0.06
style_heading
0.06
lang_ls_brkt
0.05
lang_ls_alnum
0.05
begins_with
0.03
para_foll_depth_ind
0.03
para_foll_bold_ind
0.03
style_list_num
0.02
para_prec_size_ind
0.02
para_foll_size_ind
0.02
form_font_colour_mode_ind
0.02
is_title
0.02
style_q
0.02
MOST FREQUENT VALUES

0.0
52,103
52.1%
0.5
2,378
2.4%
0.125
1,848
1.8%
0.14285714285714285
1,833
1.8%
0.16666666666666666
1,785
1.8%
0.3333333333333333
1,718
1.7%
0.2
1,708
1.7%
0.25
1,620
1.6%
0.1111111111111111
1,600
1.6%
0.1
1,584
1.6%
0.09090909090909091
1,454
1.5%
0.08333333333333333
1,302
1.3%
0.07692307692307693
1,242
1.2%
1.0
1,178
1.2%
0.07142857142857142
1,056
1.1%
SMALLEST VALUES

0.0
52,103
52.1%
0.006756756756756757
1
<0.1%
0.007407407407407408
1
<0.1%
0.007462686567164179
1
<0.1%
0.007633587786259542
1
<0.1%
0.007874015748031496
1
<0.1%
0.007936507936507936
1
<0.1%
0.008
1
<0.1%
0.00819672131147541
1
<0.1%
0.008264462809917356
1
<0.1%
0.008333333333333333
2
<0.1%
0.008403361344537815
1
<0.1%
0.00847457627118644
1
<0.1%
0.008695652173913044
1
<0.1%
0.008771929824561403
1
<0.1%
LARGEST VALUES

1.0
1,178
1.2%
0.8
1
<0.1%
0.75
3
<0.1%
0.6666666666666666
113
0.1%
0.6
13
<0.1%
0.5714285714285714
4
<0.1%
0.5
2,378
2.4%
0.45454545454545453
1
<0.1%
0.4444444444444444
7
<0.1%
0.4375
1
<0.1%
0.42857142857142855
28
<0.1%
0.4166666666666667
1
<0.1%
0.4090909090909091
1
<0.1%
0.4
169
0.2%
0.3888888888888889
1
<0.1%
lang_pct_proper_noun_singular
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.36
lang_pct_noun_plural
-0.20
lang_pct_determiner
-0.19
lang_pct_verb_base_form
-0.19
lang_pct_preposition_subordinating_conjunction
-0.19
lang_mean_words_per_sent
-0.16
lang_pct_adjective
-0.16
lang_pct_verb_sing_present_non_third_person
-0.14
lang_pct_to_infinitive_preposition
-0.14
lang_pct_verb_past_participle
-0.13
lang_pct_verb_3rd_person_sing_present
-0.13
lang_pct_possessive_pronoun
-0.12
lang_pct_punct
-0.11
lang_pct_modal
-0.11

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.26
target
0.22
lang_ls_fs
0.17
is_title
0.17
lang_ls_qm
0.16
begins_with
0.10
style_heading
0.10
is_bold
0.09
is_upper
0.08
form_font_family_mode_ind
0.06
lang_ls_clscl
0.06
style_toc
0.06
para_foll_size_ind
0.06
style_list_num
0.06
MOST FREQUENT VALUES

0.0
41,726
41.7%
1.0
4,453
4.5%
0.5
4,434
4.4%
0.3333333333333333
2,959
3.0%
0.25
2,542
2.5%
0.6666666666666666
2,138
2.1%
0.2
1,614
1.6%
0.16666666666666666
1,612
1.6%
0.14285714285714285
1,338
1.3%
0.125
1,285
1.3%
0.1111111111111111
1,175
1.2%
0.1
1,151
1.2%
0.09090909090909091
1,027
1.0%
0.75
1,000
1.0%
0.08333333333333333
951
1.0%
SMALLEST VALUES

0.0
41,726
41.7%
0.004291845493562232
1
<0.1%
0.005434782608695652
1
<0.1%
0.006329113924050633
1
<0.1%
0.006944444444444444
1
<0.1%
0.007067137809187279
1
<0.1%
0.007462686567164179
1
<0.1%
0.007518796992481203
2
<0.1%
0.007874015748031496
1
<0.1%
0.007936507936507936
1
<0.1%
0.008
2
<0.1%
0.00819672131147541
1
<0.1%
0.008403361344537815
4
<0.1%
0.008438818565400843
1
<0.1%
0.00847457627118644
2
<0.1%
LARGEST VALUES

1.0
4,453
4.5%
0.9861538461538462
1
<0.1%
0.9743589743589743
1
<0.1%
0.9444444444444444
1
<0.1%
0.9411764705882353
4
<0.1%
0.9375
1
<0.1%
0.9310344827586207
1
<0.1%
0.9285714285714286
3
<0.1%
0.9230769230769231
1
<0.1%
0.9180327868852459
1
<0.1%
0.9166666666666666
3
<0.1%
0.9090909090909091
6
<0.1%
0.9047619047619048
1
<0.1%
0.9
9
<0.1%
0.8888888888888888
15
<0.1%
lang_pct_proper_noun_plural
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.05
lang_pct_proper_noun_singular
0.04
lang_pct_noun_plural
-0.02
lang_pct_coordinating_conjunction
0.02
form_rel_font_size
0.02
lang_pct_verb_base_form
-0.02
lang_pct_punct
-0.02
lang_pct_determiner
-0.02
lang_pct_adjective
-0.02
lang_pct_verb_3rd_person_sing_present
-0.02
lang_pct_verb_sing_present_non_third_person
0.01
lang_pct_wh_abverb
-0.01
lang_pct_possessive_pronoun
-0.01
lang_pct_adverb
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

target
0.04
is_upper
0.02
lang_ls_alnum
0.02
is_title
0.02
lang_ls_qm
0.02
para_foll_size_ind
0.02
style_heading
0.02
para_foll_depth_ind
0.02
form_font_family_mode_ind
0.02
style_cover_nm_add
0.01
para_prec_size_ind
0.01
style_title
0.01
para_prec_depth_ind
0.01
para_prec_font_ind
0.01
MOST FREQUENT VALUES

0.0
97,673
97.7%
0.2
106
0.1%
0.25
80
<0.1%
0.3333333333333333
60
<0.1%
0.14285714285714285
54
<0.1%
0.16666666666666666
54
<0.1%
0.125
47
<0.1%
0.06666666666666667
38
<0.1%
0.1111111111111111
37
<0.1%
0.08333333333333333
36
<0.1%
0.05555555555555555
33
<0.1%
0.058823529411764705
32
<0.1%
0.043478260869565216
32
<0.1%
0.0625
32
<0.1%
0.1
32
<0.1%
SMALLEST VALUES

0.0
97,673
97.7%
0.0002390628735357399
1
<0.1%
0.0002501250625312656
1
<0.1%
0.00026281208935611036
1
<0.1%
0.0002774694783573807
1
<0.1%
0.0002818489289740699
1
<0.1%
0.00037243947858472997
1
<0.1%
0.0004020908725371934
1
<0.1%
0.00041118421052631577
1
<0.1%
0.00041220115416323167
1
<0.1%
0.0004139072847682119
1
<0.1%
0.00042544139544777704
1
<0.1%
0.0004784688995215311
1
<0.1%
0.0005027652086475615
1
<0.1%
0.0005263157894736842
1
<0.1%
LARGEST VALUES

0.5
27
<0.1%
0.3333333333333333
60
<0.1%
0.2857142857142857
4
<0.1%
0.25
80
<0.1%
0.2222222222222222
1
<0.1%
0.2
106
0.1%
0.18181818181818182
1
<0.1%
0.17647058823529413
1
<0.1%
0.16666666666666666
54
<0.1%
0.15384615384615385
2
<0.1%
0.14285714285714285
54
<0.1%
0.13333333333333333
5
<0.1%
0.125
47
<0.1%
0.11764705882352941
2
<0.1%
0.1111111111111111
37
<0.1%
lang_pct_predeterminer
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_determiner
0.03
lang_mean_words_per_sent
0.02
lang_pct_preposition_subordinating_conjunction
0.02
lang_pct_adverb
0.02
lang_pct_verb_sing_present_non_third_person
0.02
lang_pct_noun_singular
-0.02
lang_pct_proper_noun_singular
-0.02
lang_pct_modal
0.01
lang_pct_verb_base_form
0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_possessive_wh_pronoun
0.01
lang_pct_verb_past_participle
0.01
lang_pct_personal_pronoun
0.01
lang_pct_punct
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.02
lang_ls_fs
0.02
lang_ls_alnum
0.01
para_prec_bold_ind
0.01
is_upper
0.01
lang_ls_brkt
0.01
para_foll_bold_ind
0.01
style_heading
0.01
para_foll_depth_ind
0.01
target
0.01
para_foll_colour_ind
0.01
para_prec_depth_ind
0.01
para_foll_size_ind
0.01
para_prec_size_ind
0.00
MOST FREQUENT VALUES

0.0
99,622
99.6%
0.058823529411764705
9
<0.1%
0.017241379310344827
8
<0.1%
0.034482758620689655
7
<0.1%
0.02702702702702703
7
<0.1%
0.08333333333333333
7
<0.1%
0.047619047619047616
7
<0.1%
0.09090909090909091
7
<0.1%
0.030303030303030304
6
<0.1%
0.02127659574468085
6
<0.1%
0.06666666666666667
6
<0.1%
0.038461538461538464
6
<0.1%
0.2
6
<0.1%
0.05
6
<0.1%
0.045454545454545456
6
<0.1%
SMALLEST VALUES

0.0
99,622
99.6%
6.70331143584931e-05
1
<0.1%
0.00013610997686130393
1
<0.1%
0.00013791201213625708
1
<0.1%
0.00013819789939192924
1
<0.1%
0.0001431229426077
1
<0.1%
0.0001439055979277594
1
<0.1%
0.0001518141794443601
1
<0.1%
0.00015508684863523573
1
<0.1%
0.00016155088852988692
1
<0.1%
0.00016883336147222692
1
<0.1%
0.00016986580601324953
1
<0.1%
0.00017061934823408976
1
<0.1%
0.00017292062943109114
1
<0.1%
0.00019327406262079628
1
<0.1%
LARGEST VALUES

0.25
1
<0.1%
0.2
6
<0.1%
0.16666666666666666
1
<0.1%
0.14285714285714285
2
<0.1%
0.125
2
<0.1%
0.1111111111111111
2
<0.1%
0.10526315789473684
1
<0.1%
0.1
1
<0.1%
0.09090909090909091
7
<0.1%
0.08333333333333333
7
<0.1%
0.07692307692307693
3
<0.1%
0.07142857142857142
1
<0.1%
0.06666666666666667
6
<0.1%
0.0625
3
<0.1%
0.058823529411764705
9
<0.1%
lang_pct_possessive_ending
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.03
lang_pct_noun_singular
-0.02
lang_pct_cardinal_digit
-0.01
lang_pct_possessive_pronoun
0.01
lang_pct_punct
-0.01
lang_pct_adjective
-0.01
lang_pct_to_infinitive_preposition
0.01
lang_pct_interjection
-0.01
target_encoded
0.01
lang_pct_modal
0.01
lang_pct_verb_past_tense
-0.01
lang_pct_preposition_subordinating_conjunction
0.00
lang_pct_personal_pronoun
-0.00
customer_pk
0.00

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.04
lang_ls_fs
0.02
target
0.02
lang_ls_alnum
0.02
is_upper
0.01
style_q
0.01
style_ans
0.01
begins_with
0.01
lang_ls_brkt
0.01
lang_ls_qm
0.01
style_toc
0.01
style_bullet
0.01
style_list_num
0.01
form_font_colour_mode_ind
0.01
MOST FREQUENT VALUES

0.0
98,576
98.6%
0.05
34
<0.1%
0.07142857142857142
34
<0.1%
0.09090909090909091
31
<0.1%
0.047619047619047616
30
<0.1%
0.08333333333333333
29
<0.1%
0.1
29
<0.1%
0.058823529411764705
27
<0.1%
0.05555555555555555
27
<0.1%
0.043478260869565216
24
<0.1%
0.041666666666666664
24
<0.1%
0.037037037037037035
23
<0.1%
0.04
23
<0.1%
0.07692307692307693
22
<0.1%
0.05263157894736842
22
<0.1%
SMALLEST VALUES

0.0
98,576
98.6%
0.00010913456291607552
1
<0.1%
0.0001122334455667789
1
<0.1%
0.00011903344839900012
1
<0.1%
0.00011910433539780847
1
<0.1%
0.00011964584828906437
1
<0.1%
0.0001644195988161789
1
<0.1%
0.00016645859342488556
1
<0.1%
0.00016655562958027982
1
<0.1%
0.00017259233690024162
1
<0.1%
0.0001741098633237573
2
<0.1%
0.00017491691446562882
1
<0.1%
0.0002187705097352877
1
<0.1%
0.00022232103156958648
1
<0.1%
0.00022737608003638017
1
<0.1%
LARGEST VALUES

0.6666666666666666
1
<0.1%
0.5
5
<0.1%
0.3333333333333333
16
<0.1%
0.3
1
<0.1%
0.25
12
<0.1%
0.2
9
<0.1%
0.17647058823529413
1
<0.1%
0.16666666666666666
10
<0.1%
0.15384615384615385
1
<0.1%
0.14285714285714285
15
<0.1%
0.13333333333333333
1
<0.1%
0.13043478260869565
1
<0.1%
0.125
19
<0.1%
0.11764705882352941
1
<0.1%
0.1111111111111111
15
<0.1%
lang_pct_personal_pronoun
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_sing_present_non_third_person
0.25
lang_pct_verb_base_form
0.23
lang_pct_wh_abverb
0.16
lang_pct_proper_noun_singular
-0.11
lang_pct_noun_singular
-0.10
lang_mean_words_per_sent
0.09
lang_pct_modal
0.09
lang_pct_wh_pronoun
0.09
target_encoded
0.07
lang_pct_to_infinitive_preposition
0.06
lang_pct_possessive_pronoun
0.06
lang_pct_preposition_subordinating_conjunction
0.06
lang_pct_cardinal_digit
-0.05
lang_pct_wh_determiner
0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.21
target
0.15
lang_ls_alnum
0.15
is_title
0.11
begins_with
0.04
is_bold
0.04
lang_ls_fs
0.04
style_list_num
0.03
lang_ls_brkt
0.03
form_font_family_mode_ind
0.03
style_heading
0.03
is_upper
0.03
style_table
0.02
is_underline
0.02
MOST FREQUENT VALUES

0.0
88,414
88.4%
0.07692307692307693
362
0.4%
0.05555555555555555
341
0.3%
0.07142857142857142
337
0.3%
0.1
332
0.3%
0.06666666666666667
331
0.3%
0.058823529411764705
322
0.3%
0.0625
321
0.3%
0.09090909090909091
320
0.3%
0.08333333333333333
319
0.3%
0.1111111111111111
305
0.3%
0.05
276
0.3%
0.047619047619047616
261
0.3%
0.05263157894736842
261
0.3%
0.045454545454545456
248
0.2%
SMALLEST VALUES

0.0
88,414
88.4%
0.00045558086560364467
2
<0.1%
0.0006849315068493151
1
<0.1%
0.0007656967840735069
1
<0.1%
0.0008084074373484236
1
<0.1%
0.0008103727714748784
1
<0.1%
0.0008613264427217916
1
<0.1%
0.0010271460014673515
2
<0.1%
0.0010672358591248667
1
<0.1%
0.0010775862068965517
1
<0.1%
0.0011318619128466328
1
<0.1%
0.001146131805157593
1
<0.1%
0.001183431952662722
1
<0.1%
0.0012012012012012011
1
<0.1%
0.0012437810945273632
1
<0.1%
LARGEST VALUES

1.0
21
<0.1%
0.6
1
<0.1%
0.5
18
<0.1%
0.4
3
<0.1%
0.3333333333333333
19
<0.1%
0.2857142857142857
1
<0.1%
0.25
51
<0.1%
0.2222222222222222
5
<0.1%
0.21428571428571427
1
<0.1%
0.2
61
<0.1%
0.1875
5
<0.1%
0.18518518518518517
1
<0.1%
0.18181818181818182
12
<0.1%
0.17647058823529413
3
<0.1%
0.17073170731707318
2
<0.1%
lang_pct_possessive_pronoun
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_wh_pronoun
0.13
lang_pct_verb_3rd_person_sing_present
0.13
lang_pct_verb_base_form
0.13
target_encoded
0.12
lang_pct_proper_noun_singular
-0.12
lang_mean_words_per_sent
0.11
lang_pct_wh_abverb
0.08
lang_pct_preposition_subordinating_conjunction
0.08
lang_pct_verb_sing_present_non_third_person
0.08
lang_pct_cardinal_digit
-0.06
lang_pct_to_infinitive_preposition
0.06
lang_pct_personal_pronoun
0.06
lang_pct_noun_singular
-0.05
form_rel_font_size
-0.04

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

target
0.22
lang_ls_alnum
0.19
lang_ls_qm
0.17
is_title
0.15
lang_ls_fs
0.12
begins_with
0.09
is_upper
0.08
style_q
0.07
style_list_num
0.06
is_bold
0.05
style_table
0.05
lang_ls_brkt
0.04
form_font_family_mode_ind
0.04
form_font_colour_mode_ind
0.03
MOST FREQUENT VALUES

0.0
86,301
86.3%
0.07692307692307693
472
0.5%
0.07142857142857142
443
0.4%
0.1
434
0.4%
0.08333333333333333
427
0.4%
0.09090909090909091
403
0.4%
0.0625
387
0.4%
0.06666666666666667
385
0.4%
0.05555555555555555
369
0.4%
0.058823529411764705
361
0.4%
0.05
347
0.3%
0.05263157894736842
344
0.3%
0.1111111111111111
335
0.3%
0.125
310
0.3%
0.047619047619047616
302
0.3%
SMALLEST VALUES

0.0
86,301
86.3%
0.0004784688995215311
1
<0.1%
0.0006053268765133172
1
<0.1%
0.0006365372374283895
1
<0.1%
0.0007048872180451127
1
<0.1%
0.0007092198581560284
1
<0.1%
0.0007336757153338225
2
<0.1%
0.0008613264427217916
1
<0.1%
0.0009111617312072893
2
<0.1%
0.000970873786407767
1
<0.1%
0.0011025358324145535
1
<0.1%
0.0011560693641618498
1
<0.1%
0.0013245033112582781
1
<0.1%
0.0013568521031207597
1
<0.1%
0.0013698630136986301
1
<0.1%
LARGEST VALUES

1.0
1
<0.1%
0.5
40
<0.1%
0.3333333333333333
33
<0.1%
0.2857142857142857
2
<0.1%
0.25
53
<0.1%
0.23076923076923078
2
<0.1%
0.2222222222222222
7
<0.1%
0.2
90
<0.1%
0.1875
6
<0.1%
0.18518518518518517
1
<0.1%
0.18181818181818182
25
<0.1%
0.17647058823529413
2
<0.1%
0.17391304347826086
1
<0.1%
0.16666666666666666
197
0.2%
0.16129032258064516
1
<0.1%
lang_pct_adverb
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.11
lang_pct_verb_past_participle
0.08
lang_pct_proper_noun_singular
-0.08
lang_pct_verb_base_form
0.08
lang_mean_words_per_sent
0.06
lang_pct_to_infinitive_preposition
0.05
lang_pct_wh_abverb
0.05
lang_pct_verb_3rd_person_sing_present
0.05
lang_pct_modal
0.04
lang_pct_verb_sing_present_non_third_person
0.04
lang_pct_noun_plural
-0.04
lang_pct_personal_pronoun
0.03
target_encoded
-0.03
lang_pct_cardinal_digit
-0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_fs
0.07
lang_ls_alnum
0.06
is_upper
0.06
is_title
0.06
is_bold
0.05
target
0.04
style_heading
0.04
is_italic
0.03
begins_with
0.03
lang_ls_qm
0.03
lang_ls_clscl
0.02
style_toc
0.02
para_prec_depth_ind
0.01
para_foll_font_ind
0.01
MOST FREQUENT VALUES

0.0
83,297
83.3%
0.07692307692307693
412
0.4%
0.08333333333333333
391
0.4%
0.125
383
0.4%
0.05555555555555555
382
0.4%
0.07142857142857142
377
0.4%
0.06666666666666667
377
0.4%
0.058823529411764705
366
0.4%
0.09090909090909091
366
0.4%
0.0625
358
0.4%
0.05263157894736842
336
0.3%
0.05
334
0.3%
0.045454545454545456
328
0.3%
0.5
327
0.3%
0.043478260869565216
324
0.3%
SMALLEST VALUES

0.0
83,297
83.3%
0.0018018018018018018
1
<0.1%
0.002570694087403599
1
<0.1%
0.002777777777777778
1
<0.1%
0.002789400278940028
1
<0.1%
0.002898550724637681
1
<0.1%
0.002967359050445104
1
<0.1%
0.0030959752321981426
1
<0.1%
0.0035587188612099642
1
<0.1%
0.003703703703703704
1
<0.1%
0.003787878787878788
1
<0.1%
0.003816793893129771
1
<0.1%
0.0038910505836575876
1
<0.1%
0.004016064257028112
1
<0.1%
0.004073319755600814
1
<0.1%
LARGEST VALUES

1.0
50
<0.1%
0.6666666666666666
1
<0.1%
0.6
1
<0.1%
0.5
327
0.3%
0.4
6
<0.1%
0.375
1
<0.1%
0.3333333333333333
189
0.2%
0.3
1
<0.1%
0.2857142857142857
14
<0.1%
0.2727272727272727
3
<0.1%
0.25
179
0.2%
0.23076923076923078
4
<0.1%
0.22727272727272727
1
<0.1%
0.2222222222222222
13
<0.1%
0.21428571428571427
2
<0.1%
lang_pct_adverb_comparative
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_gerund_present_participle
0.02
lang_mean_words_per_sent
0.02
lang_pct_to_infinitive_preposition
0.02
lang_pct_proper_noun_singular
-0.01
lang_pct_adverb
0.01
lang_pct_noun_singular
-0.01
lang_pct_verb_3rd_person_sing_present
0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_verb_base_form
0.01
target_encoded
-0.01
lang_pct_wh_determiner
0.01
lang_pct_cardinal_digit
-0.01
lang_pct_personal_pronoun
0.01
lang_pct_verb_past_tense
0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_fs
0.02
is_title
0.02
lang_ls_alnum
0.01
para_foll_colour_ind
0.01
para_prec_depth_ind
0.01
style_bullet
0.01
is_upper
0.01
target
0.01
begins_with
0.01
is_underline
0.01
style_list_num
0.01
is_bold
0.01
style_table
0.01
style_heading
0.01
MOST FREQUENT VALUES

0.0
99,505
99.5%
0.03225806451612903
12
<0.1%
0.03571428571428571
10
<0.1%
0.034482758620689655
9
<0.1%
0.045454545454545456
8
<0.1%
0.09090909090909091
8
<0.1%
0.022727272727272728
8
<0.1%
0.07692307692307693
8
<0.1%
0.024390243902439025
8
<0.1%
0.029411764705882353
7
<0.1%
0.01818181818181818
7
<0.1%
0.02040816326530612
6
<0.1%
0.07142857142857142
6
<0.1%
0.0125
5
<0.1%
0.0196078431372549
5
<0.1%
SMALLEST VALUES

0.0
99,505
99.5%
7.284912945290303e-05
1
<0.1%
8.493290300662476e-05
1
<0.1%
0.00011368804001819008
1
<0.1%
0.00013607293509320997
1
<0.1%
0.00016750418760469013
1
<0.1%
0.00017259233690024162
1
<0.1%
0.00018793459875963165
1
<0.1%
0.00018821757952192734
1
<0.1%
0.00021272069772388852
1
<0.1%
0.0002187705097352877
1
<0.1%
0.0002304147465437788
1
<0.1%
0.00023342670401493932
1
<0.1%
0.00023820867079561695
1
<0.1%
0.0002390628735357399
2
<0.1%
LARGEST VALUES

0.5
2
<0.1%
0.25
2
<0.1%
0.2
3
<0.1%
0.16666666666666666
1
<0.1%
0.14285714285714285
3
<0.1%
0.1111111111111111
2
<0.1%
0.1
1
<0.1%
0.09090909090909091
8
<0.1%
0.08333333333333333
5
<0.1%
0.07692307692307693
8
<0.1%
0.07142857142857142
6
<0.1%
0.06666666666666667
5
<0.1%
0.0625
5
<0.1%
0.058823529411764705
3
<0.1%
0.05555555555555555
4
<0.1%
lang_pct_adverb_superlative
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.02
lang_pct_adjective
0.02
lang_pct_proper_noun_singular
-0.02
lang_pct_possessive_pronoun
0.02
target_encoded
0.01
lang_pct_noun_singular
-0.01
lang_pct_verb_base_form
0.01
lang_pct_determiner
0.01
lang_pct_personal_pronoun
0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_wh_determiner
0.01
lang_pct_wh_pronoun
0.01
lang_pct_verb_sing_present_non_third_person
0.01
lang_pct_possessive_wh_pronoun
0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.03
target
0.03
lang_ls_alnum
0.02
lang_ls_fs
0.02
begins_with
0.02
is_upper
0.01
para_foll_bold_ind
0.01
style_list_num
0.01
style_q
0.01
style_table
0.01
para_foll_size_ind
0.01
style_indent
0.01
form_font_colour_mode_ind
0.01
para_prec_size_ind
0.01
MOST FREQUENT VALUES

0.0
99,412
99.4%
0.06666666666666667
13
<0.1%
0.05555555555555555
13
<0.1%
0.0625
11
<0.1%
0.05
11
<0.1%
0.027777777777777776
11
<0.1%
0.058823529411764705
11
<0.1%
0.03225806451612903
10
<0.1%
0.038461538461538464
10
<0.1%
0.07692307692307693
10
<0.1%
0.04
10
<0.1%
0.03333333333333333
9
<0.1%
0.07142857142857142
9
<0.1%
0.037037037037037035
9
<0.1%
0.05263157894736842
9
<0.1%
SMALLEST VALUES

0.0
99,412
99.4%
8.493290300662476e-05
1
<0.1%
0.00011368804001819008
1
<0.1%
0.00016750418760469013
1
<0.1%
0.00018793459875963165
1
<0.1%
0.00019327406262079628
1
<0.1%
0.00019409937888198756
1
<0.1%
0.000194325689856199
1
<0.1%
0.00019447685725398678
1
<0.1%
0.0001945525291828794
1
<0.1%
0.0002010993430754793
1
<0.1%
0.0002185473883587091
1
<0.1%
0.00022232103156958648
1
<0.1%
0.00023496240601503758
1
<0.1%
0.00023557126030624264
1
<0.1%
LARGEST VALUES

0.3333333333333333
2
<0.1%
0.25
2
<0.1%
0.2
1
<0.1%
0.16666666666666666
4
<0.1%
0.14285714285714285
1
<0.1%
0.125
8
<0.1%
0.1111111111111111
5
<0.1%
0.1
6
<0.1%
0.09523809523809523
1
<0.1%
0.09090909090909091
6
<0.1%
0.08333333333333333
8
<0.1%
0.07692307692307693
10
<0.1%
0.07142857142857142
9
<0.1%
0.06666666666666667
13
<0.1%
0.06451612903225806
1
<0.1%
lang_pct_particle
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_base_form
0.04
lang_mean_words_per_sent
0.03
lang_pct_noun_singular
-0.02
lang_pct_proper_noun_singular
-0.02
lang_pct_to_infinitive_preposition
0.02
lang_pct_wh_abverb
0.02
lang_pct_modal
0.02
lang_pct_verb_past_participle
0.02
target_encoded
0.01
lang_pct_verb_sing_present_non_third_person
0.01
lang_pct_cardinal_digit
-0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_wh_determiner
0.01
lang_pct_personal_pronoun
0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.03
target
0.02
lang_ls_alnum
0.02
lang_ls_qm
0.02
lang_ls_fs
0.02
para_foll_colour_ind
0.01
is_upper
0.01
para_prec_size_ind
0.01
style_toc
0.01
is_bold
0.01
lang_ls_brkt
0.01
para_foll_italic_ind
0.01
para_prec_italic_ind
0.01
para_prec_font_ind
0.01
MOST FREQUENT VALUES

0.0
98,638
98.6%
0.06666666666666667
38
<0.1%
0.041666666666666664
33
<0.1%
0.09090909090909091
29
<0.1%
0.05263157894736842
29
<0.1%
0.05
29
<0.1%
0.0625
25
<0.1%
0.03571428571428571
25
<0.1%
0.07692307692307693
25
<0.1%
0.043478260869565216
25
<0.1%
0.03333333333333333
24
<0.1%
0.030303030303030304
24
<0.1%
0.05555555555555555
23
<0.1%
0.045454545454545456
23
<0.1%
0.058823529411764705
22
<0.1%
SMALLEST VALUES

0.0
98,638
98.6%
0.00014569825890580607
1
<0.1%
0.00017259233690024162
1
<0.1%
0.00018793459875963165
1
<0.1%
0.00020968756552736424
1
<0.1%
0.00021150592216582064
1
<0.1%
0.0002187705097352877
1
<0.1%
0.00022737608003638017
1
<0.1%
0.0002304147465437788
1
<0.1%
0.00023342670401493932
1
<0.1%
0.00023820867079561695
1
<0.1%
0.0002390628735357399
1
<0.1%
0.00024789291026276647
1
<0.1%
0.0002591344908007256
1
<0.1%
0.0002642007926023778
1
<0.1%
LARGEST VALUES

0.5
7
<0.1%
0.3333333333333333
7
<0.1%
0.25
7
<0.1%
0.2
6
<0.1%
0.16666666666666666
9
<0.1%
0.14285714285714285
12
<0.1%
0.13333333333333333
3
<0.1%
0.125
12
<0.1%
0.1111111111111111
9
<0.1%
0.10344827586206896
1
<0.1%
0.1
11
<0.1%
0.09090909090909091
29
<0.1%
0.08333333333333333
17
<0.1%
0.08064516129032258
1
<0.1%
0.08
1
<0.1%
lang_pct_to_infinitive_preposition
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_base_form
0.38
lang_mean_words_per_sent
0.24
lang_pct_proper_noun_singular
-0.14
lang_pct_noun_singular
-0.12
lang_pct_modal
0.09
lang_pct_determiner
0.07
lang_pct_cardinal_digit
-0.07
lang_pct_verb_past_participle
0.07
lang_pct_personal_pronoun
0.06
lang_pct_possessive_pronoun
0.06
lang_pct_verb_sing_present_non_third_person
0.06
lang_pct_verb_3rd_person_sing_present
0.06
lang_pct_wh_determiner
0.06
lang_pct_preposition_subordinating_conjunction
0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.24
lang_ls_fs
0.19
lang_ls_alnum
0.18
target
0.13
is_upper
0.11
is_bold
0.08
lang_ls_qm
0.07
style_list_num
0.05
style_heading
0.04
lang_ls_brkt
0.04
form_font_colour_mode_ind
0.03
style_toc
0.03
para_prec_size_ind
0.03
para_foll_depth_ind
0.02
MOST FREQUENT VALUES

0.0
77,370
77.4%
0.07142857142857142
699
0.7%
0.07692307692307693
682
0.7%
0.06666666666666667
679
0.7%
0.0625
665
0.7%
0.08333333333333333
649
0.6%
0.058823529411764705
648
0.6%
0.09090909090909091
648
0.6%
0.05555555555555555
639
0.6%
0.047619047619047616
572
0.6%
0.05263157894736842
568
0.6%
0.1
541
0.5%
0.125
531
0.5%
0.1111111111111111
519
0.5%
0.045454545454545456
514
0.5%
SMALLEST VALUES

0.0
77,370
77.4%
0.0022026431718061676
1
<0.1%
0.002205071664829107
1
<0.1%
0.002890173410404624
1
<0.1%
0.003278688524590164
1
<0.1%
0.0036101083032490976
1
<0.1%
0.003787878787878788
1
<0.1%
0.0038314176245210726
1
<0.1%
0.004329004329004329
1
<0.1%
0.004424778761061947
1
<0.1%
0.0045662100456621
1
<0.1%
0.004651162790697674
1
<0.1%
0.0047169811320754715
1
<0.1%
0.004784688995215311
1
<0.1%
0.004807692307692308
1
<0.1%
LARGEST VALUES

1.0
1
<0.1%
0.5
9
<0.1%
0.4
1
<0.1%
0.3333333333333333
70
<0.1%
0.2857142857142857
7
<0.1%
0.2727272727272727
1
<0.1%
0.25
164
0.2%
0.23076923076923078
2
<0.1%
0.2222222222222222
17
<0.1%
0.21428571428571427
5
<0.1%
0.2
224
0.2%
0.1875
1
<0.1%
0.18181818181818182
31
<0.1%
0.17857142857142858
1
<0.1%
0.17777777777777778
1
<0.1%
lang_pct_interjection
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.09
lang_mean_words_per_sent
-0.08
lang_pct_preposition_subordinating_conjunction
-0.08
lang_pct_punct
-0.07
target_encoded
-0.07
lang_pct_proper_noun_singular
-0.07
lang_pct_noun_plural
-0.06
lang_pct_adjective
-0.05
lang_pct_verb_base_form
-0.05
lang_pct_coordinating_conjunction
-0.05
lang_pct_to_infinitive_preposition
-0.04
lang_pct_verb_past_participle
-0.04
lang_pct_verb_sing_present_non_third_person
-0.03
lang_pct_modal
-0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_title
0.17
lang_ls_alnum
0.10
target
0.08
lang_ls_fs
0.06
begins_with
0.06
lang_ls_qm
0.04
para_prec_font_ind
0.03
lang_ls_clscl
0.03
style_list_num
0.03
is_bold
0.03
para_foll_depth_ind
0.02
style_heading
0.02
lang_ls_brkt
0.02
form_font_family_mode_ind
0.02
MOST FREQUENT VALUES

0.0
97,319
97.3%
1.0
706
0.7%
0.5
330
0.3%
0.3333333333333333
175
0.2%
0.06666666666666667
83
<0.1%
0.0625
55
<0.1%
0.25
50
<0.1%
0.041666666666666664
44
<0.1%
0.2
43
<0.1%
0.045454545454545456
43
<0.1%
0.058823529411764705
43
<0.1%
0.05263157894736842
39
<0.1%
0.16666666666666666
38
<0.1%
0.03333333333333333
36
<0.1%
0.038461538461538464
35
<0.1%
SMALLEST VALUES

0.0
97,319
97.3%
0.0001467351430667645
2
<0.1%
0.00022232103156958648
1
<0.1%
0.00022742779167614282
1
<0.1%
0.00023282887077997672
1
<0.1%
0.00027214587018641994
1
<0.1%
0.00027221995372260786
1
<0.1%
0.00027582402427251415
1
<0.1%
0.00029708853238265005
1
<0.1%
0.0003036283588887202
1
<0.1%
0.00031938677738741617
1
<0.1%
0.0003303600925008259
1
<0.1%
0.00033766672294445384
1
<0.1%
0.0003412386964681795
1
<0.1%
0.0003448275862068965
1
<0.1%
LARGEST VALUES

1.0
706
0.7%
0.5
330
0.3%
0.3333333333333333
175
0.2%
0.25
50
<0.1%
0.2
43
<0.1%
0.18181818181818182
1
<0.1%
0.16666666666666666
38
<0.1%
0.14285714285714285
18
<0.1%
0.125
35
<0.1%
0.1111111111111111
19
<0.1%
0.10526315789473684
1
<0.1%
0.1
28
<0.1%
0.09090909090909091
20
<0.1%
0.08695652173913043
1
<0.1%
0.08333333333333333
21
<0.1%
lang_pct_verb_base_form
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_to_infinitive_preposition
0.38
lang_pct_modal
0.32
lang_pct_personal_pronoun
0.23
lang_mean_words_per_sent
0.21
lang_pct_proper_noun_singular
-0.19
lang_pct_noun_singular
-0.15
lang_pct_possessive_pronoun
0.13
lang_pct_wh_abverb
0.11
lang_pct_cardinal_digit
-0.10
lang_pct_preposition_subordinating_conjunction
0.10
target_encoded
0.10
lang_pct_wh_determiner
0.08
lang_pct_adverb
0.08
lang_pct_determiner
0.07

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.26
is_title
0.25
target
0.23
lang_ls_fs
0.22
lang_ls_qm
0.14
is_upper
0.12
is_bold
0.09
style_list_num
0.06
style_heading
0.05
style_toc
0.05
form_font_colour_mode_ind
0.05
begins_with
0.04
style_table
0.04
style_q
0.04
MOST FREQUENT VALUES

0.0
68,228
68.2%
0.09090909090909091
1,154
1.2%
0.08333333333333333
1,084
1.1%
0.1
1,063
1.1%
0.07692307692307693
1,062
1.1%
0.1111111111111111
1,046
1.0%
0.07142857142857142
1,010
1.0%
0.125
954
1.0%
0.06666666666666667
931
0.9%
0.0625
846
0.8%
0.058823529411764705
794
0.8%
0.14285714285714285
786
0.8%
0.05555555555555555
758
0.8%
0.16666666666666666
729
0.7%
0.05263157894736842
692
0.7%
SMALLEST VALUES

0.0
68,228
68.2%
0.003105590062111801
1
<0.1%
0.0032
1
<0.1%
0.0044444444444444444
1
<0.1%
0.004464285714285714
1
<0.1%
0.0045662100456621
2
<0.1%
0.004807692307692308
1
<0.1%
0.0049504950495049506
1
<0.1%
0.005
1
<0.1%
0.005208333333333333
1
<0.1%
0.005291005291005291
1
<0.1%
0.005434782608695652
1
<0.1%
0.005988023952095809
1
<0.1%
0.006097560975609756
1
<0.1%
0.006134969325153374
1
<0.1%
LARGEST VALUES

1.0
59
<0.1%
0.6666666666666666
2
<0.1%
0.5
49
<0.1%
0.42857142857142855
3
<0.1%
0.4
13
<0.1%
0.375
5
<0.1%
0.3333333333333333
191
0.2%
0.3125
1
<0.1%
0.3076923076923077
5
<0.1%
0.30434782608695654
1
<0.1%
0.3
10
<0.1%
0.29411764705882354
1
<0.1%
0.2857142857142857
109
0.1%
0.2727272727272727
19
<0.1%
0.26666666666666666
6
<0.1%
lang_pct_verb_past_tense
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.07
lang_pct_wh_abverb
0.05
lang_pct_verb_base_form
-0.03
lang_pct_adjective
-0.02
lang_pct_punct
-0.02
lang_mean_words_per_sent
0.02
lang_pct_modal
-0.02
lang_pct_interjection
-0.02
lang_pct_cardinal_digit
-0.02
lang_pct_proper_noun_singular
-0.02
form_rel_font_size
-0.01
lang_pct_verb_gerund_present_participle
-0.01
lang_pct_verb_3rd_person_sing_present
0.01
lang_pct_verb_sing_present_non_third_person
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.05
is_title
0.05
is_upper
0.05
target
0.04
style_ans
0.02
lang_ls_brkt
0.02
style_heading
0.02
is_bold
0.02
para_foll_size_ind
0.02
lang_ls_alnum
0.02
para_foll_depth_ind
0.01
para_prec_size_ind
0.01
para_prec_colour_ind
0.01
style_list_num
0.01
MOST FREQUENT VALUES

0.0
90,964
91.0%
0.09090909090909091
229
0.2%
0.058823529411764705
215
0.2%
0.07692307692307693
214
0.2%
0.07142857142857142
206
0.2%
0.1
199
0.2%
0.08333333333333333
198
0.2%
0.1111111111111111
194
0.2%
0.0625
193
0.2%
0.06666666666666667
192
0.2%
0.14285714285714285
174
0.2%
0.05555555555555555
170
0.2%
0.05263157894736842
169
0.2%
0.047619047619047616
169
0.2%
0.16666666666666666
168
0.2%
SMALLEST VALUES

0.0
90,964
91.0%
0.000970873786407767
1
<0.1%
0.001004016064257028
1
<0.1%
0.0014097744360902255
1
<0.1%
0.0015060240963855422
1
<0.1%
0.001589825119236884
1
<0.1%
0.0016611295681063123
1
<0.1%
0.0017667844522968198
1
<0.1%
0.0018018018018018018
1
<0.1%
0.0018975332068311196
1
<0.1%
0.002028397565922921
1
<0.1%
0.0020491803278688526
3
<0.1%
0.0022026431718061676
1
<0.1%
0.002224199288256228
1
<0.1%
0.002254791431792559
1
<0.1%
LARGEST VALUES

1.0
11
<0.1%
0.5
144
0.1%
0.4
3
<0.1%
0.3333333333333333
156
0.2%
0.2857142857142857
11
<0.1%
0.25
129
0.1%
0.2222222222222222
12
<0.1%
0.2
165
0.2%
0.1875
1
<0.1%
0.18181818181818182
11
<0.1%
0.17647058823529413
1
<0.1%
0.16666666666666666
168
0.2%
0.15789473684210525
1
<0.1%
0.15384615384615385
6
<0.1%
0.15151515151515152
1
<0.1%
lang_pct_verb_gerund_present_participle
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_proper_noun_singular
-0.08
lang_pct_noun_singular
-0.07
lang_pct_punct
-0.05
target_encoded
0.05
lang_pct_adjective
-0.03
lang_pct_interjection
-0.02
lang_pct_cardinal_digit
-0.02
lang_pct_adverb_comparative
0.02
lang_pct_possessive_pronoun
0.02
lang_pct_determiner
-0.02
lang_mean_words_per_sent
0.02
lang_pct_verb_base_form
-0.02
lang_pct_verb_past_participle
-0.02
lang_pct_coordinating_conjunction
0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.06
target
0.05
style_heading
0.03
lang_ls_brkt
0.03
begins_with
0.02
style_list_num
0.02
style_bullet
0.02
is_title
0.02
lang_ls_fs
0.01
para_foll_bold_ind
0.01
para_prec_bold_ind
0.01
lang_ls_alnum
0.01
is_bold
0.01
lang_ls_clscl
0.01
MOST FREQUENT VALUES

0.0
83,477
83.5%
0.06666666666666667
424
0.4%
0.07692307692307693
406
0.4%
0.09090909090909091
396
0.4%
0.08333333333333333
395
0.4%
0.1
378
0.4%
0.058823529411764705
369
0.4%
0.05555555555555555
362
0.4%
0.0625
361
0.4%
0.07142857142857142
360
0.4%
0.1111111111111111
339
0.3%
0.05263157894736842
331
0.3%
0.05
327
0.3%
0.5
326
0.3%
0.047619047619047616
319
0.3%
SMALLEST VALUES

0.0
83,477
83.5%
0.0012706480304955528
1
<0.1%
0.0016611295681063123
1
<0.1%
0.0022026431718061676
1
<0.1%
0.002205071664829107
1
<0.1%
0.0023923444976076554
1
<0.1%
0.002652519893899204
1
<0.1%
0.002733485193621868
2
<0.1%
0.0028296547821165816
1
<0.1%
0.002849002849002849
1
<0.1%
0.0028735632183908046
1
<0.1%
0.002881844380403458
1
<0.1%
0.002890173410404624
1
<0.1%
0.002912621359223301
1
<0.1%
0.003115264797507788
1
<0.1%
LARGEST VALUES

1.0
211
0.2%
0.6666666666666666
10
<0.1%
0.5
326
0.3%
0.42857142857142855
1
<0.1%
0.4
10
<0.1%
0.375
1
<0.1%
0.3333333333333333
261
0.3%
0.3125
2
<0.1%
0.3
1
<0.1%
0.29411764705882354
1
<0.1%
0.2857142857142857
4
<0.1%
0.2727272727272727
2
<0.1%
0.2631578947368421
1
<0.1%
0.25
202
0.2%
0.23076923076923078
2
<0.1%
lang_pct_verb_past_participle
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_proper_noun_singular
-0.13
lang_pct_noun_singular
-0.12
lang_pct_preposition_subordinating_conjunction
0.11
lang_mean_words_per_sent
0.11
lang_pct_wh_abverb
0.09
lang_pct_modal
0.09
lang_pct_adverb
0.08
lang_pct_verb_3rd_person_sing_present
0.08
lang_pct_verb_sing_present_non_third_person
0.07
lang_pct_to_infinitive_preposition
0.07
lang_pct_cardinal_digit
-0.06
lang_pct_wh_determiner
0.06
lang_pct_verb_base_form
0.05
lang_pct_adjective
-0.04

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.12
is_title
0.11
lang_ls_qm
0.09
lang_ls_fs
0.09
is_upper
0.09
target
0.08
is_bold
0.04
style_heading
0.03
style_toc
0.03
style_list_num
0.03
lang_ls_brkt
0.02
para_prec_size_ind
0.02
is_underline
0.02
form_font_colour_mode_ind
0.01
MOST FREQUENT VALUES

0.0
75,799
75.8%
0.07692307692307693
738
0.7%
0.06666666666666667
719
0.7%
0.07142857142857142
714
0.7%
0.08333333333333333
710
0.7%
0.09090909090909091
690
0.7%
0.1
655
0.7%
0.0625
652
0.7%
0.1111111111111111
646
0.6%
0.058823529411764705
640
0.6%
0.05263157894736842
574
0.6%
0.125
568
0.6%
0.05555555555555555
553
0.6%
0.05
517
0.5%
0.14285714285714285
512
0.5%
SMALLEST VALUES

0.0
75,799
75.8%
0.002544529262086514
1
<0.1%
0.0035335689045936395
1
<0.1%
0.0040650406504065045
2
<0.1%
0.00425531914893617
1
<0.1%
0.004464285714285714
1
<0.1%
0.004694835680751174
1
<0.1%
0.004739336492890996
1
<0.1%
0.004807692307692308
1
<0.1%
0.0049261083743842365
1
<0.1%
0.005076142131979695
1
<0.1%
0.005263157894736842
2
<0.1%
0.005405405405405406
2
<0.1%
0.0056657223796034
1
<0.1%
0.005714285714285714
1
<0.1%
LARGEST VALUES

1.0
95
<0.1%
0.6666666666666666
1
<0.1%
0.5
235
0.2%
0.42857142857142855
1
<0.1%
0.4
7
<0.1%
0.375
1
<0.1%
0.3333333333333333
235
0.2%
0.3
6
<0.1%
0.2857142857142857
21
<0.1%
0.2727272727272727
9
<0.1%
0.26666666666666666
2
<0.1%
0.25
243
0.2%
0.23809523809523808
1
<0.1%
0.23529411764705882
2
<0.1%
0.23076923076923078
13
<0.1%
lang_pct_verb_sing_present_non_third_person
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_personal_pronoun
0.25
lang_pct_wh_pronoun
0.17
lang_pct_wh_abverb
0.15
lang_pct_wh_determiner
0.14
lang_pct_proper_noun_singular
-0.14
lang_pct_noun_singular
-0.13
lang_mean_words_per_sent
0.12
lang_pct_noun_plural
0.09
target_encoded
0.08
lang_pct_possessive_pronoun
0.08
lang_pct_verb_past_participle
0.07
lang_pct_to_infinitive_preposition
0.06
lang_pct_verb_base_form
0.06
lang_pct_cardinal_digit
-0.06

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.23
lang_ls_alnum
0.18
target
0.18
is_title
0.15
is_upper
0.08
lang_ls_fs
0.07
is_bold
0.05
style_list_num
0.05
begins_with
0.04
style_heading
0.04
style_toc
0.03
lang_ls_brkt
0.03
style_table
0.03
form_font_family_mode_ind
0.03
MOST FREQUENT VALUES

0.0
82,718
82.7%
0.07692307692307693
557
0.6%
0.08333333333333333
537
0.5%
0.09090909090909091
522
0.5%
0.0625
520
0.5%
0.06666666666666667
508
0.5%
0.07142857142857142
485
0.5%
0.1
482
0.5%
0.05555555555555555
474
0.5%
0.058823529411764705
459
0.5%
0.1111111111111111
441
0.4%
0.045454545454545456
408
0.4%
0.047619047619047616
406
0.4%
0.05
389
0.4%
0.05263157894736842
387
0.4%
SMALLEST VALUES

0.0
82,718
82.7%
0.0016
1
<0.1%
0.001763668430335097
1
<0.1%
0.0018744142455482662
1
<0.1%
0.001953125
1
<0.1%
0.001968503937007874
1
<0.1%
0.002570694087403599
1
<0.1%
0.002898550724637681
1
<0.1%
0.002967359050445104
1
<0.1%
0.0029940119760479044
1
<0.1%
0.003105590062111801
1
<0.1%
0.003278688524590164
1
<0.1%
0.003355704697986577
1
<0.1%
0.0035587188612099642
1
<0.1%
0.003703703703703704
1
<0.1%
LARGEST VALUES

0.5
76
<0.1%
0.4
1
<0.1%
0.3333333333333333
60
<0.1%
0.3
1
<0.1%
0.2857142857142857
4
<0.1%
0.2727272727272727
1
<0.1%
0.25
86
<0.1%
0.23076923076923078
4
<0.1%
0.2222222222222222
12
<0.1%
0.21428571428571427
1
<0.1%
0.2127659574468085
1
<0.1%
0.2
186
0.2%
0.19047619047619047
1
<0.1%
0.1875
2
<0.1%
0.18181818181818182
23
<0.1%
lang_pct_verb_3rd_person_sing_present
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_wh_pronoun
0.23
lang_mean_words_per_sent
0.15
lang_pct_wh_abverb
0.14
lang_pct_possessive_pronoun
0.13
lang_pct_proper_noun_singular
-0.13
lang_pct_determiner
0.12
lang_pct_existential_there
0.11
lang_pct_wh_determiner
0.11
lang_pct_noun_singular
-0.09
target_encoded
0.08
lang_pct_verb_past_participle
0.08
lang_pct_preposition_subordinating_conjunction
0.07
lang_pct_noun_plural
-0.06
lang_pct_to_infinitive_preposition
0.06

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.28
lang_ls_alnum
0.22
target
0.20
is_title
0.18
is_upper
0.09
is_bold
0.08
lang_ls_fs
0.08
begins_with
0.06
style_list_num
0.05
lang_ls_brkt
0.05
style_heading
0.05
style_table
0.04
form_font_colour_mode_ind
0.04
para_prec_size_ind
0.03
MOST FREQUENT VALUES

0.0
79,279
79.3%
0.08333333333333333
677
0.7%
0.07142857142857142
664
0.7%
0.07692307692307693
662
0.7%
0.09090909090909091
660
0.7%
0.1
645
0.6%
0.1111111111111111
632
0.6%
0.0625
614
0.6%
0.06666666666666667
606
0.6%
0.058823529411764705
549
0.5%
0.05555555555555555
521
0.5%
0.05263157894736842
490
0.5%
0.125
485
0.5%
0.05
479
0.5%
0.047619047619047616
468
0.5%
SMALLEST VALUES

0.0
79,279
79.3%
0.0011025358324145535
1
<0.1%
0.0015384615384615385
1
<0.1%
0.0021598272138228943
1
<0.1%
0.002325581395348837
1
<0.1%
0.0027397260273972603
1
<0.1%
0.002777777777777778
1
<0.1%
0.002779708130646282
1
<0.1%
0.003105590062111801
1
<0.1%
0.003278688524590164
1
<0.1%
0.0033333333333333335
1
<0.1%
0.0034453057708871662
1
<0.1%
0.003524229074889868
1
<0.1%
0.0035460992907801418
1
<0.1%
0.0035971223021582736
1
<0.1%
LARGEST VALUES

0.5
9
<0.1%
0.4
1
<0.1%
0.3333333333333333
167
0.2%
0.2857142857142857
7
<0.1%
0.25
132
0.1%
0.23076923076923078
1
<0.1%
0.2222222222222222
13
<0.1%
0.21428571428571427
1
<0.1%
0.2
191
0.2%
0.1875
1
<0.1%
0.18181818181818182
25
<0.1%
0.17647058823529413
3
<0.1%
0.17391304347826086
1
<0.1%
0.16666666666666666
298
0.3%
0.16
1
<0.1%
lang_pct_wh_determiner
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.18
lang_pct_verb_sing_present_non_third_person
0.14
lang_pct_verb_3rd_person_sing_present
0.11
lang_pct_modal
0.10
lang_pct_proper_noun_singular
-0.08
lang_pct_verb_base_form
0.08
lang_pct_preposition_subordinating_conjunction
0.07
lang_pct_noun_singular
-0.07
lang_pct_determiner
0.06
lang_pct_verb_past_participle
0.06
lang_pct_to_infinitive_preposition
0.06
lang_pct_cardinal_digit
-0.04
target_encoded
0.04
lang_pct_personal_pronoun
0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.13
lang_ls_fs
0.13
is_title
0.12
target
0.10
is_upper
0.05
lang_ls_qm
0.04
is_bold
0.04
style_list_num
0.03
begins_with
0.03
style_toc
0.02
style_heading
0.02
form_font_colour_mode_ind
0.02
para_foll_depth_ind
0.02
lang_ls_brkt
0.02
MOST FREQUENT VALUES

0.0
93,978
94.0%
0.058823529411764705
156
0.2%
0.045454545454545456
142
0.1%
0.047619047619047616
139
0.1%
0.043478260869565216
136
0.1%
0.04
130
0.1%
0.05
128
0.1%
0.05263157894736842
119
0.1%
0.03571428571428571
119
0.1%
0.05555555555555555
115
0.1%
0.038461538461538464
115
0.1%
0.037037037037037035
111
0.1%
0.03125
111
0.1%
0.041666666666666664
110
0.1%
0.0625
110
0.1%
SMALLEST VALUES

0.0
93,978
94.0%
0.0005793742757821553
1
<0.1%
0.0007867820613690008
1
<0.1%
0.0008110300081103001
1
<0.1%
0.000864304235090752
1
<0.1%
0.0010080645161290322
1
<0.1%
0.0010775862068965517
1
<0.1%
0.0010845986984815619
2
<0.1%
0.0011655011655011655
2
<0.1%
0.001349527665317139
1
<0.1%
0.0013596193065941536
1
<0.1%
0.0013812154696132596
1
<0.1%
0.0016020506247997437
1
<0.1%
0.0016113438607798904
1
<0.1%
0.0017006802721088435
1
<0.1%
LARGEST VALUES

0.3333333333333333
1
<0.1%
0.25
1
<0.1%
0.2
9
<0.1%
0.16666666666666666
16
<0.1%
0.15384615384615385
1
<0.1%
0.14285714285714285
18
<0.1%
0.125
20
<0.1%
0.11764705882352941
1
<0.1%
0.1111111111111111
26
<0.1%
0.10526315789473684
1
<0.1%
0.1
44
<0.1%
0.09523809523809523
6
<0.1%
0.09090909090909091
71
<0.1%
0.08695652173913043
7
<0.1%
0.08333333333333333
93
<0.1%
lang_pct_wh_pronoun
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_verb_3rd_person_sing_present
0.23
lang_pct_verb_sing_present_non_third_person
0.17
lang_pct_possessive_pronoun
0.13
target_encoded
0.12
lang_pct_proper_noun_singular
-0.10
lang_pct_personal_pronoun
0.09
lang_pct_preposition_subordinating_conjunction
0.05
lang_pct_noun_singular
-0.05
form_rel_font_size
-0.05
lang_pct_cardinal_digit
-0.05
lang_pct_verb_base_form
0.04
lang_mean_words_per_sent
0.04
lang_pct_to_infinitive_preposition
0.03
lang_pct_verb_past_participle
0.03

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.37
target
0.22
lang_ls_alnum
0.16
is_title
0.11
begins_with
0.10
style_list_num
0.07
is_upper
0.05
form_font_family_mode_ind
0.05
lang_ls_fs
0.05
lang_ls_clscl
0.04
style_q
0.03
form_font_colour_mode_ind
0.03
lang_ls_brkt
0.03
is_bold
0.03
MOST FREQUENT VALUES

0.0
94,390
94.4%
0.1
230
0.2%
0.09090909090909091
220
0.2%
0.07142857142857142
210
0.2%
0.1111111111111111
204
0.2%
0.08333333333333333
202
0.2%
0.05555555555555555
194
0.2%
0.07692307692307693
194
0.2%
0.06666666666666667
190
0.2%
0.058823529411764705
172
0.2%
0.0625
171
0.2%
0.125
162
0.2%
0.05
142
0.1%
0.045454545454545456
138
0.1%
0.043478260869565216
133
0.1%
SMALLEST VALUES

0.0
94,390
94.4%
0.0001467351430667645
2
<0.1%
0.0002671653753673524
1
<0.1%
0.000333000333000333
1
<0.1%
0.00047709923664122136
1
<0.1%
0.0004957858205255329
1
<0.1%
0.0005385029617662897
1
<0.1%
0.0005420054200542005
1
<0.1%
0.0006056935190793458
1
<0.1%
0.0006821282401091405
1
<0.1%
0.0007048872180451127
1
<0.1%
0.0007067137809187279
1
<0.1%
0.0007092198581560284
1
<0.1%
0.0007380073800738007
1
<0.1%
0.0007656967840735069
1
<0.1%
LARGEST VALUES

0.3333333333333333
1
<0.1%
0.2857142857142857
1
<0.1%
0.25
12
<0.1%
0.2
39
<0.1%
0.18181818181818182
4
<0.1%
0.16666666666666666
75
<0.1%
0.15384615384615385
5
<0.1%
0.14285714285714285
131
0.1%
0.13636363636363635
1
<0.1%
0.13333333333333333
10
<0.1%
0.13043478260869565
2
<0.1%
0.125
162
0.2%
0.11764705882352941
14
<0.1%
0.1111111111111111
204
0.2%
0.10526315789473684
12
<0.1%
lang_pct_possessive_wh_pronoun
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_mean_words_per_sent
0.04
lang_pct_to_infinitive_preposition
0.02
lang_pct_verb_3rd_person_sing_present
0.02
lang_pct_determiner
0.01
lang_pct_predeterminer
0.01
lang_pct_adjective_superlative
0.01
lang_pct_verb_sing_present_non_third_person
0.01
lang_pct_verb_base_form
0.01
lang_pct_proper_noun_singular
-0.01
lang_pct_adverb_superlative
0.01
lang_pct_modal
0.01
lang_pct_noun_singular
-0.01
lang_pct_preposition_subordinating_conjunction
0.01
lang_pct_verb_past_participle
0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_fs
0.02
lang_ls_alnum
0.02
is_title
0.01
para_foll_underline_ind
0.01
para_prec_font_ind
0.01
form_font_family_mode_ind
0.01
para_prec_depth_ind
0.01
is_upper
0.01
is_bold
0.01
lang_ls_brkt
0.01
target
0.01
para_foll_colour_ind
0.00
para_prec_size_ind
0.00
para_foll_size_ind
0.00
MOST FREQUENT VALUES

0.0
99,898
99.9%
0.029411764705882353
4
<0.1%
0.02040816326530612
3
<0.1%
0.02631578947368421
2
<0.1%
0.014084507042253521
2
<0.1%
0.01282051282051282
2
<0.1%
0.030303030303030304
2
<0.1%
0.017241379310344827
2
<0.1%
0.021739130434782608
2
<0.1%
0.05
2
<0.1%
0.03225806451612903
2
<0.1%
0.03333333333333333
2
<0.1%
0.043478260869565216
2
<0.1%
0.02
2
<0.1%
0.01639344262295082
2
<0.1%
SMALLEST VALUES

0.0
99,898
99.9%
0.0001439055979277594
1
<0.1%
0.00014569825890580607
1
<0.1%
0.0001774622892635315
1
<0.1%
0.00021150592216582064
1
<0.1%
0.00022737608003638017
1
<0.1%
0.0002304147465437788
1
<0.1%
0.00023342670401493932
1
<0.1%
0.00023820867079561695
1
<0.1%
0.0002646202699126753
1
<0.1%
0.0002671653753673524
1
<0.1%
0.00028328611898016995
1
<0.1%
0.00029850746268656717
1
<0.1%
0.0003204101249599487
1
<0.1%
0.0003222687721559781
1
<0.1%
LARGEST VALUES

0.07142857142857142
1
<0.1%
0.06666666666666667
1
<0.1%
0.0625
1
<0.1%
0.05970149253731343
1
<0.1%
0.05
2
<0.1%
0.047619047619047616
1
<0.1%
0.045454545454545456
1
<0.1%
0.043478260869565216
2
<0.1%
0.041666666666666664
1
<0.1%
0.034482758620689655
1
<0.1%
0.03333333333333333
2
<0.1%
0.03225806451612903
2
<0.1%
0.03125
1
<0.1%
0.030303030303030304
2
<0.1%
0.029411764705882353
4
<0.1%
lang_pct_wh_abverb
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_personal_pronoun
0.16
lang_pct_verb_sing_present_non_third_person
0.15
lang_pct_verb_3rd_person_sing_present
0.14
lang_pct_verb_base_form
0.11
target_encoded
0.11
lang_pct_proper_noun_singular
-0.10
lang_pct_verb_past_participle
0.09
lang_pct_possessive_pronoun
0.08
lang_pct_noun_singular
-0.08
lang_pct_modal
0.06
lang_mean_words_per_sent
0.06
lang_pct_verb_past_tense
0.05
lang_pct_cardinal_digit
-0.05
lang_pct_adverb
0.05

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_qm
0.27
target
0.20
lang_ls_alnum
0.15
is_title
0.12
begins_with
0.07
style_list_num
0.06
is_upper
0.06
style_q
0.04
form_font_family_mode_ind
0.04
is_bold
0.03
lang_ls_brkt
0.03
para_prec_size_ind
0.03
form_font_colour_mode_ind
0.03
para_prec_depth_ind
0.03
MOST FREQUENT VALUES

0.0
92,796
92.8%
0.07692307692307693
241
0.2%
0.09090909090909091
230
0.2%
0.08333333333333333
228
0.2%
0.1
227
0.2%
0.07142857142857142
221
0.2%
0.058823529411764705
219
0.2%
0.05555555555555555
209
0.2%
0.06666666666666667
206
0.2%
0.0625
206
0.2%
0.047619047619047616
185
0.2%
0.05
184
0.2%
0.05263157894736842
177
0.2%
0.125
177
0.2%
0.1111111111111111
176
0.2%
SMALLEST VALUES

0.0
92,796
92.8%
0.0002671653753673524
1
<0.1%
0.0003679175864606328
1
<0.1%
0.000390625
1
<0.1%
0.0004711425206124853
1
<0.1%
0.00047281323877068556
1
<0.1%
0.0005684402000909505
1
<0.1%
0.0005834305717619603
1
<0.1%
0.0006006006006006006
1
<0.1%
0.0006116207951070336
1
<0.1%
0.0006849315068493151
1
<0.1%
0.000741839762611276
1
<0.1%
0.0007432181345224824
1
<0.1%
0.0008149959250203749
1
<0.1%
0.0008613264427217916
1
<0.1%
LARGEST VALUES

1.0
1
<0.1%
0.5
3
<0.1%
0.3333333333333333
10
<0.1%
0.2857142857142857
2
<0.1%
0.25
20
<0.1%
0.2222222222222222
14
<0.1%
0.2
37
<0.1%
0.1875
1
<0.1%
0.18181818181818182
2
<0.1%
0.17647058823529413
1
<0.1%
0.16666666666666666
131
0.1%
0.15789473684210525
1
<0.1%
0.15384615384615385
14
<0.1%
0.15
2
<0.1%
0.14285714285714285
136
0.1%
lang_pct_punct
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.19
lang_pct_proper_noun_singular
-0.11
lang_mean_words_per_sent
0.09
lang_pct_noun_plural
-0.08
lang_pct_interjection
-0.07
lang_pct_adjective
-0.06
lang_pct_coordinating_conjunction
-0.06
lang_pct_verb_gerund_present_participle
-0.05
lang_pct_to_infinitive_preposition
-0.05
lang_pct_determiner
-0.05
lang_pct_preposition_subordinating_conjunction
-0.05
lang_pct_foreign_word
0.04
lang_pct_wh_pronoun
0.03
form_rel_font_size
-0.02

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

lang_ls_alnum
0.53
lang_ls_brkt
0.38
lang_ls_clscl
0.35
lang_ls_fs
0.12
is_title
0.11
target
0.11
is_upper
0.10
style_heading
0.07
lang_ls_qm
0.07
style_toc
0.06
para_foll_bold_ind
0.04
para_foll_size_ind
0.03
para_prec_bold_ind
0.03
form_font_colour_mode_ind
0.03
MOST FREQUENT VALUES

0.0
39,655
39.7%
0.3333333333333333
2,989
3.0%
0.25
2,792
2.8%
0.5
2,761
2.8%
0.16666666666666666
2,540
2.5%
0.14285714285714285
2,450
2.5%
0.2
2,401
2.4%
0.125
2,350
2.4%
0.1111111111111111
2,202
2.2%
0.1
2,119
2.1%
0.09090909090909091
1,922
1.9%
0.08333333333333333
1,772
1.8%
0.07692307692307693
1,587
1.6%
0.07142857142857142
1,356
1.4%
0.06666666666666667
1,223
1.2%
SMALLEST VALUES

0.0
39,655
39.7%
0.006097560975609756
1
<0.1%
0.007692307692307693
1
<0.1%
0.011111111111111112
1
<0.1%
0.014285714285714285
1
<0.1%
0.015873015873015872
1
<0.1%
0.017857142857142856
1
<0.1%
0.01818181818181818
2
<0.1%
0.018518518518518517
2
<0.1%
0.018867924528301886
3
<0.1%
0.01904761904761905
1
<0.1%
0.019230769230769232
1
<0.1%
0.0196078431372549
2
<0.1%
0.02
4
<0.1%
0.02040816326530612
4
<0.1%
LARGEST VALUES

0.9696969696969697
1
<0.1%
0.96
1
<0.1%
0.9444444444444444
1
<0.1%
0.8888888888888888
2
<0.1%
0.875
1
<0.1%
0.8
2
<0.1%
0.75
24
<0.1%
0.7352941176470589
1
<0.1%
0.7142857142857143
1
<0.1%
0.6666666666666666
131
0.1%
0.6428571428571429
1
<0.1%
0.631578947368421
1
<0.1%
0.625
5
<0.1%
0.6153846153846154
1
<0.1%
0.6
68
<0.1%
lang_pct_sym
MISSING:
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

lang_pct_noun_singular
-0.02
lang_mean_words_per_sent
-0.02
lang_pct_preposition_subordinating_conjunction
-0.02
lang_pct_punct
-0.02
lang_pct_proper_noun_singular
-0.01
target_encoded
-0.01
lang_pct_adjective
-0.01
lang_pct_noun_plural
-0.01
lang_pct_determiner
-0.01
lang_pct_coordinating_conjunction
-0.01
lang_pct_verb_base_form
-0.01
customer_pk
-0.01
lang_pct_to_infinitive_preposition
-0.01
lang_pct_verb_3rd_person_sing_present
-0.01

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

is_upper
0.08
is_title
0.04
lang_ls_alnum
0.02
target
0.02
lang_ls_fs
0.01
style_head_foot
0.01
style_list_num
0.01
lang_ls_qm
0.01
begins_with
0.01
para_foll_depth_ind
0.01
para_prec_depth_ind
0.01
lang_ls_clscl
0.01
style_heading
0.01
lang_ls_brkt
0.01
MOST FREQUENT VALUES

0.0
99,917
>99.9%
1.0
46
<0.1%
0.3333333333333333
3
<0.1%
0.2
2
<0.1%
0.1
2
<0.1%
0.07142857142857142
2
<0.1%
0.02631578947368421
2
<0.1%
0.25
2
<0.1%
0.05555555555555555
1
<0.1%
0.00031826861871419476
1
<0.1%
0.00023282887077997672
1
<0.1%
0.16666666666666666
1
<0.1%
0.0004854368932038835
1
<0.1%
0.07692307692307693
1
<0.1%
0.14285714285714285
1
<0.1%
SMALLEST VALUES

0.0
99,917
>99.9%
0.00021150592216582064
1
<0.1%
0.00023282887077997672
1
<0.1%
0.00023496240601503758
1
<0.1%
0.00025608194622279127
1
<0.1%
0.00026518164942985947
1
<0.1%
0.00026567481402763017
1
<0.1%
0.00031826861871419476
1
<0.1%
0.00044483985765124553
1
<0.1%
0.0004854368932038835
1
<0.1%
0.0005299417064122947
1
<0.1%
0.0006872852233676976
1
<0.1%
0.001321003963011889
1
<0.1%
0.005154639175257732
1
<0.1%
0.010309278350515464
1
<0.1%
LARGEST VALUES

1.0
46
<0.1%
0.3333333333333333
3
<0.1%
0.25
2
<0.1%
0.2
2
<0.1%
0.16666666666666666
1
<0.1%
0.14285714285714285
1
<0.1%
0.125
1
<0.1%
0.1
2
<0.1%
0.09090909090909091
1
<0.1%
0.07692307692307693
1
<0.1%
0.07142857142857142
2
<0.1%
0.05555555555555555
1
<0.1%
0.041666666666666664
1
<0.1%
0.03125
1
<0.1%
0.02631578947368421
2
<0.1%